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Preface to the Corrected Third Printing 



As a third edition of this book has been called for, we have taken the op- 
portunity of making a few corrections and additions throughout the text. 
Bibliographical Notes have been left unchanged. 

Our thanks are due to a number of our collegues and students for pointing out 
errors and misprints, and in particular we are grateful to Prof. D. Schliiter of 
the Christian-Albrechts-Universitat Kiel for the trouble which he has taken 
in supplying us with a somewhat thorough list. 

Rome, July 2003 Dino Boccaletti 

Giuseppe Pucacco 




Preface to the First Edition 



Half a century ago, S. Chandrasekhar wrote these words in the preface to his 
celebrated and successful bookd 

In this monograph an attempt has been made to present the theory of stellar dy- 
namics as a branch of classical dynamics - a discipline in the same general category 
as celestial mechanics. [. . .] Indeed, several of the problems of modern stellar dy- 
namical theory are so severely classical that it is difficult to believe that they are 
not already discussed, for example, in Jacobi’s Vorlesungen. 

Since then, stellar dynamics has developed in several directions and at var- 
ious levels, basically three viewpoints remaining from which to look at the 
problems encountered in the interpretation of the phenomenology. Roughly 
speaking, we can say that a stellar system (cluster, galaxy, etc.) can be con- 
sidered from the point of view of celestial mechanics (the A/'-body problem 
with ^ 1), fluid mechanics (the system is represented by a material con- 
tinuum), or statistical mechanics (one deflnes a distribution function for the 
positions and the states of motion of the components of the system). 

The three different approaches do not of course exclude one another, and 
very often they coexist in the treatment of certain problems. It may sound 
obvious if we state that the various problems are reduced and schematized 
in such a way that they can be looked at from one of the above viewpoints 
and with the tools that can be provided by the relevant discipline. However 
paradoxical it might appear (given the enormous amount of work produced by 
mathematicians on the A/^-body problem), it is our opinion that it is the first 
kind of approach which has received the least attention from the researchers 
on stellar dynamics or, at least, has received much less attention than the 
progress in the held could allow. If, from the publication of Chandrasekhar’s 
book up to the present this has indeed happened it is due (in our opinion) 
mainly to two things. 

The first is to do with the belief that the results of celestial mechanics 
always refer to only a few bodies and therefore cannot be applied to stellar 
dynamics (many bodies); in more concrete terms the situation has been such 
that the dialogue between those who have dealt with the problems of celestial 

^ S. Chandrasekhar: Principles of Stellar Dynamics (University of Chicago Press, 
1942; reprinted by Dover, New York, 1960), p. VII. 
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mechanics (the mathematicians) and those who have dealt with the problems 
of stellar dynamics (the astrophysicists) has been minimal. We shall come 
back to this last point, later on. 

The second and much more recent, concerns the ever-growing and some- 
how overwhelming use of computers. In the last decade, in particular, nu- 
merical simulations have a more and more important position with respect 
to analytical studies. Doubtless computers are an exceedingly powerful tool 
for investigating certain problems. However, in our opinion they should be 
used to single out those points on which to concentrate analytical study rather 
than as a short cut to avoid it. That is, computers should promote analytical 
study rather than replace it. This was the purpose of the well-known paper by 
Henon and Heiles^, which has opened up new horizons to a branch of math- 
ematical physics. In addition, as proof of how essential the above-mentioned 
dialogue is, the paper deals with a stellar dynamical subject. 

Furthermore, we are convinced that knowledge of the problems of the ce- 
lestial mechanics (at a non-elementary level) is indispensable for those dealing 
with stellar dynamics: moreover, we think that there should not be a sharp 
boundary between the two disciplines. This led us to make the whole area the 
subject of a single book, albeit in two volumes. Our purpose is therefore to 
provide researchers in astronomy and astrophysics with an as thorough and 
clear an exposition as possible of the problems which constitute the founda- 
tions of celestial mechanics and stellar dynamics. It is therefore intended that 
for the latter the chosen approach is the first of those listed above. 

There is now a general agreement that “mathematical” and “physical” 
cultures are quite distinct and that they also have difficulties, sometimes, in 
understanding each other. In our opinion, this situation, owing to the ever- 
increasing specialization of scientific learning, causes damage that is partic- 
ularly severe in the field of astronomy and astrophysics. 

Whereas in the past (we are speaking of a “golden age” that ended in the 
1920s) the astronomer and the astrophysicist could take advantage of current 
work in mathematics and physics, today this is not only impossible but even 
unthinkable. The university education of astronomers and astrophysicists is 
overwhelmingly of the “physical” type: the mathematical tools acquired are 
inadequate for tackling the reading of any mathematical paper whatsoever 
(of mathematical physics, the theory of differential equations, etc.), which 
turns out to be necessary in some research. This is also because there is 
an irresistible tendency for everybody (and therefore mathematicians also) 
to retire into their own special language. Astronomers and astrophysicists, 
owing to the nature of the things they are dealing with, continually need 
to resort to results obtained by physicists and mathematicians. In the latter 
case, for the reason given above, that turns out to be exceedingly difficult 
and sometimes impossible. It is obvious, and we are convinced of this, that 

^ M. Henon, C. Heiles: The applicability of the third integral of motion: some nu- 
merical experiments, Astron. J. 69 , 73-79 (1964). 
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IX 



the times and cultural environments in which personalities like Poincare, 
Jeans or Eddington were present cannot come back again. However, we are 
also convinced that one can and must do something to better the present 
situation. 

Our aim in planning, and in writing, this book has been to contribute to 
ferrying from the “mathematical” side to the “astronomical-astrophysical” 
side some of the results achieved in the last few decades, which we con- 
sider essential to anyone dealing with solar system, stellar systems, galactic 
dynamics, etc. It is clear that in an operation of this kind it may happen that 
some of the things to be ferried fall overboard, whether the boat was over- 
loaded or the boatman not expert enough: we hope, however, to have kept 
the losses within acceptable limits. To continue with the methaphor, to the 
people living on the side at which the boat lands, we assume preparation 
to the intermediate graduate level (calculus, differential equations, vector 
calculus, . . .). 

We have done our best to provide a self consistent treatment, at least at 
a first level of understanding, to spare the reader continuous jumps from one 
textbook to another; at the same time we have also endeavoured to facilitate 
the deepening of individual arguments supplying indispensable information, 
including bibliographic details. 

The point of view we have assumed is that of discussing the problems 
and not of going into the details of different applications: we have tried to 
single out the fundamental problems (i. e. the mathematical models) and to 
present them in as clear and readable a way as possible for a reader having 
the mathematical background assumed above. We also consider the reader 
to be fully acquainted with celestial mechanics at undergraduate level, to the 
extent that can be obtained, for instance, from an excellent book such as 
Danby’s^. 

By tradition, the old textbooks on celestial mechanics used to include a 
chapter devoted to Hamiltonian mechanics, an indispensable tool for pertur- 
bation theory. We have not escaped from the tradition and the first volume 
includes a chapter devoted to selected topics of dynamics and dynamical sys- 
tems. The second chapter, devoted to the two-body problem, is not meant to 
replace traditional expositions (which are assumed known to the reader) but 
simply to emphasize features of the problem which can prompt further de- 
velopments. The third and the fourth chapter (the Wbody problem and the 
three-body problem) follow on in the same spirit, giving much space to results 
so far to be found only in the original papers. The fifth chapter, to our mind, 
is intermediate between celestial mechanics and stellar dynamics as usually 
agreed upon. In all four chapters (from the second to the fifth), besides some 
novelties (we believe) in the planning of the material and the exposition of 
recent results, classical arguments sanctioned by tradition remain. For the 

^ J. M. A. Danby: Fundamentals of Celestial Mechanics^ 2nd Revised & Enlarged 
Edition (Willmann-Bell, Richmond, 1988). 




X Preface to the First Edition 

latter, we have sometimes drawn our “inspiration” from the expositions of 
authors whose works can now be considered “classics” and whom the reader 
will found mentioned in the notes to each chapter. 

In the second volume the first three chapters are devoted to the theory 
of perturbations, starting from classical problems and arriving at the KAM 
theory and to the introduction of the use of the Lie transform. A whole chap- 
ter treats the theory of adiabatic invariants and its applications in celestial 
mechanics and stellar dynamics. Also the theory of resonances is illustrated 
and applications in both fields are shown. Classical and modern problems 
connected to periodic solutions are reviewed. The description of modern de- 
velopements of the theory of Chaos in conservative systems is the subject 
of a chapter in which is given an introduction to what happens in both 
near-integrable and non-integrable systems. The invaluable help provided by 
computers in the exploration of the long time behaviour of dynamical sys- 
tems is acknowledged in a final chapter where some numerical algorithms and 
their applications both to systems with few degrees of freedom and to large 
A'-body systems, are illustrated. 
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Introduction — The Theory of Orbits 
from Epicycles to “Chaos” 



The subject of this book is the study of the orbits followed by a body (mass 
point) subjected to the gravitational attraction of a given number of other 
bodies (mass points). We shall start from the minimum number (the two- 
body problem) and proceed to the case where it is convenient to represent 
the action exerted by a great number of bodies by means of a mean poten- 
tial. Although the concept of the orbit as a continuous line drawn in three- 
dimensional space by the subsequent positions of a moving mass point may 
appear elementary and hence be considered a primitive notion which should 
only require intuition, nevertheless it has, over the centuries, undergone some 
kind of evolution. In view of future developments, we shall choose as a start- 
ing point for the evolution of the concept of the orbit the formulation given 
by the Greeks in the third century B.C. 



Prom the Theory of Epicycles to Newton 



Although historians of ancient astronomy have evidence of a notable devel- 
opment in astronomy among both the Egyptians and the Babylonians, it was 
only among the Greeks, and in a sophisticated mathematical environment, 
that the theories were born which, with some adjustment and improvement, 
would dominate for about two thousand years, i.e. up to the Copernican 
revolution. 

The Greeks formalized their ideas about the motion of celestial bodies 
in the theory of eccentrics and in the theory of epicycles. The fundamental 
fact to keep in mind is that according to the Greeks the celestial bodies 
could not move on any kind of curve unless it was a circle (the perfect curve 
and therefore the only one worthy of a celestial body). This implied that 
any motion could only be either a uniform circular motion or a combination 
of uniform circular motions. We should also add that, in so doing, they re- 
nounced the possibility of considering seriously the problem of investigating 
the true nature of the physical system of the world, namely of finding the 
causes of the motion of celestial bodies. The various mathematical elabora- 
tions were basically oriented towards the description of motions. Through 
a crude schematization, we can say that from Hipparchus (second century 
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B.C.), through Ptolemy (second century A.D.), to Newton (the Principia 
was published in 1687) attention has been given mainly to the kinematics 
and not to the dynamics of celestial bodies. 

The theory which almost certainly appeared first is that of the eccentrics. 
At the basis of this theory the Earth is situated at the centre of the universe 
and the Moon rotates around it on a circular orbit with a period of 27 days; 
the Sun also rotates around the Earth with a period of one year; the centre 
of these orbits does not coincide with the position of the Earth. The inner 
planets, Mercury and Venus, move on circles whose centres are always on the 
line joining the Sun and the Earth; the Earth remains outside these circles. 
The outer planets (Mars, Jupiter, Saturn) also move on circles whose centres 
are on the line joining the Earth and the Sun, but these circles have so great 
a radius as to always encircle both the Earth and the Sun. This theory was 
employed by Hipparchus to represent the motion of the Sun. 

The eccentric (see Fig. 1) is a circle whose centre (C) does not coincide 
with the position of the observer^ on the Earth {E). If the Sun {S) moves 
with uniform motion on this circle, and hence the angle ACS increases 
uniformly, it will not be the same for the angle AES: this will increase 
more slowly when S is close to A (the apogee, or farthest point from the 
Earth) and more quickly when S is close to B (the perigee, or closest point 
to the Earth). 





The motion of the Sun could equally be obtained by means of the epicycle 
model, which dates back to Apollonius (end of the third century B.C.). In 
such a model, the body whose motion must be represented is considered to 
be moving uniformly on a circle (epicycle). The centre of the epicycle moves 
uniformly on a second circle called the deferent. To demonstrate the said 
equivalence, we shall refer to Fig. 2. If one takes as the deferent a circle equal 
to the eccentric but with its centre at E (the dashed circle) , and if one takes 
the point S' on it such that E S' is parallel to C 5, then S' S is equal 
and also parallel to EC. Then the Sun S', which moves uniformly on the 

^ As the observer is in a geocentric system, motions should be reproduced as seen 
from the Earth. 
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eccentric, can be considered in the same way to be in uniform motion on a 
circle of radius S S' whose centre S' moves uniformly on the deferent. In 
this case, therefore, the two methods lead to the same result. The arrows in 
Fig. 2 show that the motion on the epicycle must occur in a direction opposite 
to that of the motion on the deferent. 

Similar considerations apply for the motion of the Moon, although in this 
case the theory turns out to be very rudimentary and fails to give an account 
of the greater complexity of the lunar motion compared to the solar one. 
Hipparchus found a way to overcome these difficulties by taking an eccentric 
whose centre describes a circle around the Earth in a period of nearly 9 years 
(corresponding to the motion of the apses). The epicycle theory was later 
improved by using as a deferent an eccentric circle and by introducing a new 
point called the equant^ placed symmetrically to the Earth with respect to 
the centre of the deferent. This point replaced the centre of the deferent as 
a point from which to see the centre of the epicycle moving with constant 
angular velocity. Somehow, this violated the rule that the considered circular 
motions should be uniform. Later still, the theory was improved by adding 
secondary epicycles, epicycles which had an epicycle as a deferent. 

We shall not deal with this matter further, since our purpose has been 
only to capture the spirit of the epicycle theory. It would be the basis of the 
Ptolemaic system which would reach, substantially unchanged in its funda- 
mental structures, the age of the Renaissance and beyond. It is well known, 
in fact, that, during the thirteen centuries following the composition of the 
Almagest^ the models worked out by Ptolemy were modified and improved 
several times, particularly by the Arabian astronomers, and therefore instead 
of the Ptolemaic system one should speak of Ptolemaic systems; however, the 
basic concepts have never been a matter of controversy. The Earth was im- 
mobile at the centre of the world and all the other bodies moved around 
it with motions that, however complicated, would have to be explained as 
suitable compositions of circular motions. 

The idea of circular motion continued to obsess even Copernicus, who with 
his work Revolutionibus orbium coelestium” (published in Nuremberg in 
1543, the year of his death) founded the new cosmology and established a 
new theory of planetary motion. The title of Chap. 4 of the first book of De 
Revolutionibus in fact reads “The reason why the motion of celestial bodies 
is uniform circular and perpetual or composed of circular motions” , and later 
on, with the need to employ rectilinear motion, he deemed it necessary to 
demonstrate that a rectilinear motion could be generated by the composition 
of two circular motions. By moving the centre of the system from the Earth 
to the Sun, Copernicus succeeded in obtaining for all the planets, as for 
the Earth, quite simple orbits (circular in a first approximation) and no 
longer curves like that of Fig. 3, which represents the orbits (seen from the 
Earth!) obtained from the epicycle theory. Copernicus granted to the Earth, 



^ Nikolaus Kopernicus: Gesamtausgabe (Munich, 1949). 
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nevertheless, a prime role, by assuming in his planetary theories that the 
centre of the terrestrial orbit, rather than the bary centre of the Sun-planet 
system, was the centre of every motion. Of course, this led to mistakes which 
still required the use of epicycles as correctives in order to be in agreement 
with the results of the observations (albeit still rather inaccurate). This, 
however, did not worry the astronomers, who were used to a veritable crowd 
of epicycles; in fact, Copernicus, in that little work known as Commentariolus^ 
which he had circulated handwritten and which seems to precede the editing 
of the De Revolutionibus^ concluded almost in triumph: “And in this way. 
Mercury moves altogether by seven circles, Venus by five, the Earth by three 
and, around it, the Moon by four; at last Mars, Jupiter and Saturn, each by 
five. As a consequence, 34 circles are sufficient to explain the entire structure 
of the universe, as well as the dance of the planets”. Notwithstanding the 
error of having assumed the centre of the terrestrial orbit to be the centre of 
every motion (an error which, as later emphasized by Kepler, is considerable 
in the case of Mars’s motion), Copernicus marked the beginning of a new 
era for astronomy and for the determination of the orbits of celestial bodies. 
The world had to wait a little less than 70 years from the publication of De 
Revolutionibus to reach, with Kepler, the very high point of what we have 
called “the kinematics of celestial bodies” and at the same time the sunset 
of the epicycles. 

However, the disappearance of the epicycles was only temporary: some 
time later, like the Phoenix, they rose again from their ashes. In fact, as 
remarked (perhaps for the first time) by G. V. Schiaparelli in the last century, 
Fourier series expansions brought the epicycles^ back again, in modern dress, 
in celestial mechanics. 

Between 1609 and 1619 Kepler announced those results which have been 
known ever since as his “three laws” . Although he also addressed his attention 
to the causes of the motion, the results in this field had never been very 
positive (for instance, he ascribed the motion of the planets around the Sun 
to a magnetic force different from gravity). Prom our point of view, therefore, 
it is convenient to take into consideration Kepler’s work solely as the end point 
of the kinematics of celestial bodies: it is from Kepler’s three laws that Newton 
could eventually deduce the law of universal gravitation, and thereafter it 
became truly possible to speak of the dynamics of celestial bodies. 

Although according to some historians the first intuition of the law of 
universal gravitation dates back to as early as 1666, when Newton was only 
24, it is twenty years afterwards that Newton presented to the Royal Society 
of London a copy of the paper that would be printed one year later (1687) in 
its ultimate form and with the title “Naturalis Philosophiae Principia Math- 

^ See S. Sternberg: Celestial Mechanics^ Part I (W. A. Benjamin, 1969), Chap. 1; and 
D. G. Saari: A visit to the Newtonian A-body problem via elementary complex 
variables. Amer. Math. Monthly 97 , 105-119 (1990). 
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ematica”. Newton’s law^ is deduced from Kepler’s three laws and Kepler’s 
three laws are deduced from Newton’s law. With Newton, not only was a 
kinematics of the orbits made available, but at last also a quite general phys- 
ical law, applicable according to the particular problem of interest. Newton’s 
law refers to the gravitational interaction of two bodies (to be considered 
either mass points or with a spherical mass distribution), and thence if we 
apply it to the planetary motion we must refer to the ideal case, which implies 
only the Sun and the planet under examination: the two-body problem. 

Any deviation from the calculated Keplerian orbit has to be attributed to 
the action of the other bodies of the solar system; action which in principle 
can be evaluated, since we are able to express through Newton’s law the force 
acting between any two bodies: the N-body problem. 

The substantial change which occurred with respect to what was done 
before Newton lies just in this: up to that time, every orbit, as seen from 
the Earth, was reproduced with some approximation by combining a certain 
number of circular motions whose presence was justified only by the results 
they enabled to be obtained and there was no aknowledgement whatsoever 
of the fact that the entire solar system was “held up” by the mutual interac- 
tions of the various components. With Newton, on the other hand, the orbit 
followed by each celestial body could be exactly calculated, in principle, if 
one were to take into account all the bodies involved and their mutual inter- 
actions. One was then in the position where the mathematical model could 
completely reproduce the physical phenomenon. Possible inadequacies did 
not derive from failures of the model, but on calculational difficulties. 

In the Principia^ Newton applied only geometrical methods and because 
of that the evaluation of the perturbation of the Keplerian orbits became 
particularly cumbersome. The most important case, of course, was the motion 
of the Moon, to which he devoted almost all of the third book as well as great 
energy, since it appears^ he told Halley that the problem of the motion of the 
Moon made his head ache and kept him awake so often that he would think 
of it no more. 

The influence of the Sun on the motion of the Moon around the Earth 
is so important that the problem of the lunar motion must be considered a 
three-body problem. Just to give some idea of how the perturbing force of 
the third body varies in a complicated manner, by using only geometrical 
tools a la Newton, let us consider the Earth-Moon-Sun system,^ as shown 
in Fig. 4 (where the radius of the lunar orbit - approximated by a circle - is 
exaggerated with respect to the distance from the Sun). In the figure, mi and 
m2 represent two positions of the Moon in its circular orbit around the Earth 
(E) and S represents the position of the Sun. The vectors miATi, m2 K2 and 

^ See, for instance Sir Isaac Newton: Principia, translated by Motte and revised by 
Cajori. (University of California Press, 1934). 

^ The quotation is extracted from F. R. Moulton: An Introduction to Celestial Me- 
chanics, 2nd edition (Macmillan, 1914), p. 363. 

® The example and the figure are taken from Moulton: op cit. p. 337. 
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EN give on suitable scales the acceleration on the Moon and on the Earth 
due to the action of the Sun. By decomposing miKi^ rri 2 K 2 in such a way 
that one of the components is equal and parallel to EN, one has the result 
that the perturbing acceleration due to the Sun is given in the two positions 
by mi Pi, and rri 2 P 2 respectively. The perturbing acceleration in P varies 
then from point to point and is directed toward the line ES, or its extension 
beyond E when the distance mS is greater than ES. 

The evaluations of the various effects that could be obtained through a 
study of the Newtonian geometrical constructions correspond roughly to the 
first order^ of the perturbative theories which would be developed in the 
following century. After Newton, almost only analytical methods would be 
applied, the geometrical, ones being reserved for elementary explanations of 
the causes of the various types of perturbation. 



The Theory of Orbits from Newton to Poincare 

From the third and last edition of the Principia (1726) to the publication of 
the Traite de Mecanique Celeste of Laplace (the first volume was published 
in 1799), the theory of orbits progressed significantly through the work of 
essentially five persons: Euler (1707-1783), Clairaut (1713-1765), D’Alembert 
(1717-1783), Lagrange (1736-1813) and Laplace himself (1749-1827). 

In a little more than half a century, one goes from the geometrical proofs 
of Newton to a generalized use of analytical tools which, very often, were 
created just to solve the problems of celestial mechanics (it is enough to 
mention the case of the method of variation of arbitrary constants in the 
solution of differential equations). We have chosen the work of Laplace as a 
means of marking the process of formal elaboration of Newtonian mechanics, 
even if this may appear objectionable from the point of view of the historian of 

^ For the motion of the lunar perigee, in fact, Newton had obtained a value which 
was only half that given by observations. In 1872, in his unpublished manuscripts, 
the exact evaluation was found in which Newton considered perturbations of the 
second order. 
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science, since in the last century a kind of general model for the formalization 
of a physical theory was identified in the Mecanique Celeste. 

In 1803, the third volume of the Mecanique Celeste^ dedicated to Napoleon 
Bonaparte, was published. Laplace wrote in the preface as follows (in trans- 
lation) : 

“We have given in the first section of this work the general principles of the equi- 
librium and the motion of the matter. Their application to the celestial motions led 
us, without hypotheses and through a series of mathematical arguments,^ to the law 
of universal gravitation, of which gravity and the motion of projectiles on the Earth 
are but particular cases. Then, taking into account a system of bodies submitted 
to this great law of the nature, we have reached, by means of a particular analysis, 
the general expressions of their motions, of their figures and of the oscillations of 
the fluids which cover them; expressions from which one has seen all the observed 
phenomena of the tides, of the variations of the degrees and the weight at the sur- 
face of the Earth, of the precession of equinoxes, of the libration of the Moon, of 
the figure and rotation of the Saturn rings, and their permanence on its equatorial 
plane derived. [. . .] In our century, the successors of this great man [Newton] have 
built up the construction of which he had laid down the foundations. They have im- 
proved the infinitesimal Analysis, they have invented the differential Calculus and 
the finite difference Calculus, and have turned into formulae the whole Mechanics. 
By applying these findings to the law of gravity, they have explained by this law 
all the celestial phenomena and given an unexpected accuracy to the theories and 
astronomical tables [. . .] It is mainly in the applications of the Analysis to the Sys- 
tem of the world where the power of this wonderful tool, without which it had been 
impossible to understand a mechanism as complicated in its effects as simple in its 
causes, is shown”. 

What might be called the “the Laplacian belief’ is encapsulated in the above 
quotation. All the motions of celestial bodies are explained “without hy- 
potheses and through a series of mathematical arguments”, starting from 
the general principles of mechanics. The “wonderful tool” of analysis is the 
weapon which allows one to win even the most difficult challenges of astron- 
omy. The future of any system of gravitating bodies can be exactly foreseen 
for eternity as long as we know its present state: it is merely a matter of 
calculation, that is of application of the “wonderful tool”. In fact, to quote 
Laplace,^ once more, this time from “Essai philosophique sur les Probabilites” 
(in translation): 

“We shall then imagine the present condition of the universe as the effect of its 
previous condition and as the cause of the forthcoming one. An intelligence which, 
at a given instant of time, would know all the forces which act on the nature and 
the natural conditions of the elements which compose it, if in addition he were 
sufficiently powerful to modelize these data with the Analysis, would include in the 
same formula the motions of the greatest bodies of the universe as well as those of 

^ The italics are ours. 

® For the quoted excerpts, see P. S. Laplace: Oeuvres Complies (Gauthiers-Villars, 
Paris, 1878). 
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the lightest atoms; nothing would be uncertain to him, and the future, as well as 
the past, would be present at his sight”. 

This “absolute” determinism of Laplace would later on find its expression and 
formal endorsement in the theorem of existence and uniqueness for ordinary 
differential equations. Before reaching this point, let us pause for a while in 
order to evaluate the situation. In the days when Laplace was writing, the 
gravitational constant was known with a certain accuracy (the Cavendish 
experiment in 1798 had given a result of 6.75; the error with respect to 
today’s measurements was about 1%); by knowing the masses involved in 
each problem, one could write the related system of differential equations. 
The immediately following step was to consider the problem as the sum of 
two others: two-body problem and a perturbation due to the action of every 
other body (in some cases only one). 

The solution of the first problem being known {the unperturbed problem) 
mathematical techniques had to be developed to handle the perturbation. 
Generally, an expansion in a power series of a small quantity (the eccentricity, 
the ratio of one of the masses and that of the Sun, etc.) was used and the 
problems of the convergence of the series were neglected. As a consequence, it 
was thought possible to approximate the “true” , physical orbit more closely 
the more series terms were taken into account. The fact that two new planets, 
Uranus (1781) and Neptune (1846), were discovered through the use of this 
kind of calculation only confirmed what we have called the “Laplacian Belief’ . 

Let us go back to the theory of ordinary differential equations, which we 
have already mentioned, to locate something very strange. In a number of 
books, mostly concerning dynamical systems, the existence and unique- 
ness theorem for ordinary differential equations is not only considered the 
formal expression of determinism in mechanics, but also appears to be so 
intimately connected with it that determinism could not even exist without 
that theorem. As a matter of fact, the two questions from the very beginning 
have grown independently from each other and only towards the end of 
the nineteenth century, when through the work of Peano the question of the 
uniqueness of the solution was clarified, could one begin to overlap the theo- 
rem and the deterministic principle so that one became the supporting arm 
of the other. In any case, the theory of orbits, by the theorem of existence 
and uniqueness for the solution of ordinary differential equations and the 
further theorem of continuous dependence of this on the initial conditions, 
achieved at a certain point the enviable condition of “perfect science” where 
the conformity of the mathematical model to the physical reality and the 
rigour of the mathematical tool gave to the predictions of celestial mechanics 
the features of infallibility and precision. 

^°See, for instance, V. I. Arnold: Mathematical Methods of Classical Mechanics 
(Springer, 1978). 

^^See G. Israel: II determinismo e la teoria delle cquazioni differenziali ordinarie^ 
PHYSIS 28 , 305-358, (1991). 
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In fact, once a given problem was set up in the correct manner (that is, 
the correct system of differential equations was written down), the theory 
guaranteed (apart from the existence or otherwise of singularities) that the 
solution existed because of the properties of the regularity of the functions 
representing the components of the given forces. Furthermore, the solution 
corresponding to given initial conditions, was also unique. Since the initial 
data were the result of measurement operations and therefore subject to 
errors, the corresponding solution consequently provided predictions subject 
to errors; the above theorem of the continuous dependence of the solution on 
the initial conditions also guaranteed that the error implied in the solution 
was small, provided the initial data measurement error was small. 

The state of affairs appeared to be such as to make Moulton write, 
about one century after Laplace: 

“At the present time Celestial Mechanics is entitled to be regarded as the most 
perfect science and one of the most splendid achievements of the human mind. No 
other science is based on so many observations extending over so long a time. In no 
other science is it possible to test so critically its conclusions, and in no other are 
theory and experience in so perfect accord”. 

But even at the time Moulton wrote this, the situation was not such as to 
justify that tone of triumph. From 1890,^^ when the memoire couronne of 
Poincare was published, the seeds had been spread of many plants destined 
to grow in our century, and some of them with plenty of thorns. 

Prom Poincare to the Deterministic Chaos: 

The Sensitive Dependence on the Initial Conditions 

Poca favilla gran fiamma seconda 
Dante: Paradis o I, 34 

We have already mentioned that, for the problems of celestial mechanics, a 
“strategy”, generically called perturbation theory, had been developed. Al- 
though this name covered and still covers very different theories, one feature 
is common to all these theories: each problem was solved by “perturbing” 
a problem whose solution was already known. The perturbation was repre- 
sented by a series expansion. These series, heirs, as we have already said, of 
the ancient epicycles, contained however, a lot of contradictions. First of all, 
the problem of convergence. In fact, although at the beginning the series were 
used freely without difficulty, at a certain point it began to be asked whether 
one was really dealing with convergent series. The answer was provided by 
Poincare, as already mentioned: in general, these series did not converge; 

^^See F. R. Moulton: op. cit. p. 430. 

Poincare: Sur le probleme des trois corps et les equations de la dynamique, 
Acta Mathematica 13 , 1-271 (1890). 
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as was later clarified, they converged only for some frequencies. A second 
problem was given by the use of approximated expressions obtained by trun- 
cating the expansions: they contained “secular” or “mixed” terms, that is, 
terms proportional to t (the variable indicating the time) or to t multiplied 
by a periodic term. In both cases, it was clear that these expressions would 
lose any meaning if they were used over a fairly long interval of time. 

A.Lindsted (1882) found the method to overcome this inconvenience. 
Starting from the observation that the non-linearity of the equation involved 
certainly modified the frequency ujq of the unperturbed problem, he ex- 
pressed the variable t through a new independent variable multiplied by a 
power series (of a “small” quantity e), where the coefficients were given by 
different frequencies (o;i, cj 2 , • • • , etc.). By fixing afterwards, at each per- 
turbative order, the value of the corresponding uji in a suitable manner, he 
eliminated the terms having t as a factor. 

Besides the presence of secular terms, the perturbative series were affected 
by another pathology: small divisors. For systems of two or more degrees 
of freedom, even at the first perturbative order, linear combinations of the 
frequencies, which could take a very small value, close to zero, appeared in 
the denominators of the terms of the expansion. When these divisors were as 
small as zero, the frequencies were “in resonance” . 

It is clear from what has been said so far that the perturbation theory, 
notwithstanding its successes, was close to entering into a deep crisis. This 
had also an impact on the old problem of the stability of the solar system, 
which had been formalized in the answer to a prize question which the King 
of Sweden had proposed in 1885 (the prize was awarded to H. Poincare) as 
follows: For an arbitrary system of mass points which attract each other ac- 
cording to Newton’s laws, assuming that no two points ever collide, give the 
coordinates of the individual points for all time as the sum of a uniformly 
convergent series whose terms are made up of known functions. 

The situation was eased by A. N. Kolmogorov, who proposed in 1954 a 
theorem, later on demonstrated by V. I. Arnold and J. Moser and ever since 
universally known as the “KAM theorem”, where the conditions which had 
to be verified in order that a motion undergoing a “small perturbation” could 
preserve the features of the unperturbed motion were laid down. As we shall 
see in Chap. 1, if one considers the phase space of a system with n degrees of 
freedom, for an integrable system one has that the trajectories of the repre- 
sentative point are constrained to remain on the surface of an n-dimensional 
torus defined by n constants of motion of the system itself. 

The “KAM theorem” states that, under suitable conditions the perturba- 
tions do not “destroy” the tori but only deform them, and then the trajec- 
tories are still constrained to wind around n-dimensional surfaces which are 

^"^A. N. Kolmogorov: Doklady Akad. Nauk. 98, 527 (1954). English translation 
in Stochastic Behavior in Classical and Quantum Hamiltonian Systems ed. by 
G. Casati, J. Ford (Springer, 1979), pp. 51-56. 
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close to the initial tori. The conditions required by the theorem are however 
very stringent; besides allowing only very small perturbations and excluding 
the cases of resonance, the theorem also established how far from possible 
resonances we must remain. The KAM theorem is not applicable, therefore, 
to the solar system, where various cases of resonance occur; nevertheless its 
appearence brought new life to studies of the perturbation theories and once 
again caught the attention of mathematical physicists to celestial mechanics. 
Almost at the same time as the demonstration of the KAM theorem, a fur- 
ther topic appeared, which has raised an enormous interest in the last few 
years under the appealing title of “deterministic chaos”. The development 
and the improvement of computing systems, in the 1960s, made available 
calculations which up to that time were not feasible. It was then possible to 
develop a method for studying the non-integrable systems already suggested 
by Poincare and Birkhoff: the “surfaces of section”. Without going into the 
details, which will be described in Volume 2, let us imagine drawing in phase 
space a surface which intersects the trajectories of the representative points 
of the system. If the system is integrable, and consequently the trajectories 
are wrapped around the n-dimensional tori of which we have already made 
mention, the intersection with the above surface shall give as a result points 
(more and more numerous with increasing time) which will lie along well- 
defined curves (intersections of the tori with the surface). Obviously, in the 
case of closed orbits (simply periodic motions) one will have as intersections 
isolated points that cross at each period. 

In 1964, M. Henon and C. Heiles^^ investigated by this method the motion 
of a star in a galaxy, representing the gravitational attraction of the galaxy by 
a potential having cylindrical symmetry. The chosen potential was given by 
the potential of a planar oscillator to which two terms were added (of third 
degree in the coordinates). Those terms made the problem non-linear and 
non-integrable. Henon and Heiles found that, for very low values of the total 
energy (the minimum value chosen by them was E = 1/12), the system was 
very close to being integrable, since the vast majority of intersection points 
were placed along well-defined curves corresponding to the intersections with 
the tori. By raising the energy, the regular curves progressively disappeared, 
through a phase where they were reduced to tiny isolated zones. For £* = 1/6, 
in practice the whole available area (with the exception of extremely small 
“islands”) was filled thickly by points generated by a single trajectory (see 
Figs. 5, 6 and 7). 

The result of Henon and Heiles emphasized two features. The first was 
that the system was “more integrable” than one would expect from the use 
of purely analytical methods: in fact, at low energies, the system was “practi- 
cally” integrable. Numerical methods could then provide much help in guess- 
ing if a system was very close to the condition of integrability or not (it was 
this belief, among others, that gave reason for such research). The second fea- 

Henon, C. Heiles: See Footnote 2 in the Preface. 
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Fig. 6 



Fig. 7 



ture was that the more the energy increased and the system departed from 
integrability, the more the motion became “chaotic” . 

We shall now try to explain the meaning of this adjective. We have already 
recalled the theorem of the continuous dependence of a solution (in our case 
a trajectory) on the initial data. Quantitatively, if Xi (t) and yi (t) are two 
solutions of the system x = X{x,t) defined in the closed interval [to, ti] 
and K is the constant introduced in the Lipschitz condition (see Footnote 1, 
Sect. 1.1), then 

I Xi (t) -yi{t)\ < I Xi (to) - Vi (to) I . 

This inequality does not prevent two curves which were close at the time 
t = to exponentially diverging afterwards within a very short time; for some 
systems this is what really occurs, and is what is called a sensitive dependence 
on initial conditions. Until recently, physicists thought that this could only 
occur in very particular cases (unstable equilibrium positions, etc.). What is 
new is the realization that in some systems one has sensitive dependence on 
initial conditions, whatever the initial conditions, that is, the dependence is 
not the exception but the rule. In this case, one speaks of chaotic motions 
and deterministic chaos. 

In the Henon-Heiles problem, the system, as the energy grows, becomes 
a (partially) chaotic system: the intersections of the trajectories with the 
surface of section thickly fill the area determined by energy conservation, but 
points corresponding to trajectories originating in “close” points are here 
very “far”. The exponential divergence of the trajectories characterizes the 
chaotic motion; in spite of this, the chaos is “deterministic” , in the sense that 
the divergence of the trajectories is “determined” by the initial conditions 
according to the equations of the system. This “determinism” , however, does 
not help at all in predicting the system’s behaviour, because in practical 
cases one has to deal with experimental errors. With regard to this, we cite 
what J. Hadamard wrote in 1898; together with Poincare, he can be rightfully 
considered the forerunner of those who characterize the present trend: 

^^See J. Hadamard: Les surfaces a courbures opposees et leurs lignes geodesiques, 
J. Math, et Appl. 4, 27-73 (1898). 
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“Certainly, when a system moves under the action of given forces and the initial 
conditions of the motion have given values, in the mathematical meaning of the 
word, the subsequent motion and, as a consequence, its behaviour, when t increases 
indefinitely are for these very reasons known. However, in the astronomical prob- 
lems, it will not be like that: the constants which define the motion are physically 
given, that is with some errors whose magnitude is reduced as long as the power of 
our observation means increases, but which is impossible to cancel. If one follows 
the trajectories only for a given time, on the other hand arbitrary, one could think 
that the errors on the initial conditions have been reduced to such an extent that 
they cannot appreciably modify the shape of these trajectories during the time 
interval under consideration. [. . .] it is not by all means correct to draw a similar 
conclusion regarding the final behaviour of the same curves. This may very well 
depend [. . .] on arithmetical discontinuity properties of integration constants” (in 
translation). 

If we consider that even the simplest systems (and with only two degrees of 
freedom) are in general mixed systems, that is, they can have both regular 
solutions and chaotic behaviour, then we realize that the Laplace universe, 
this gigantic and perfect watch, has disappeared for ever in the haze of the 
past. But if, as Poincare states, “. . . the real aim of Celestial Mechanics is 
not to calculate the ephemerides, because one could be satisfied of a short 
term forecast, but to recognize if the Newton law is sufficient to explain all 
the phenomena”, the task of determining the orbits of celestial bodies still 
remains to challenge the researcher. 

Unlike Hipparchus and Ptolemy, we are now able to calculate ephemerides 
with great accuracy, to foresee when and at which point of a planet a space 
probe will land, but we do not yet know if the solar system (or, better, its 
Newtonian model) is stable, nor are we able to foresee if one of the thousands 
of asteroids which fill it will some day end up hitting the Earth. There are 
still many questions without answers. 



^^H. Poincare: Les Methodes nouvelles de la Mecanique Celeste (Gauthiers-Villars, 
Paris, 1892) - Vol. I, p. 5. 




Chapter 1 

Dynamics and Dynamical Systems — 
Quod Satis 



In some cases it is easier to describe what something is not, rather than what 
it is or what it wants to be. We shall therefore adhere to this principle when 
introducing this chapter. 

First of all, this chapter is not a summary of the subjects mentioned in 
the title. Only a few topics from mechanics appear, and furthermore they 
are reviewed while we constantly keep in mind the problem of integrability 
and the search for first integrals. If only a selection of topics from mechanics 
is given, and moreover aimed at a specific goal, still less is said about dy- 
namical systems and stability, in fact the minimum sufficient to make a few 
indispensable concepts familiar. Obviously, one could argue that the reader 
could find elsewhere the subjects explained here, by consulting various text- 
books on analytical dynamics, dynamical systems and some review articles. 
It was important to us, however, that some subjects were explained in a par- 
ticular manner, with certain features emphasized and consequently treated in 
such a fashion as to be applicable according to a specific conceptual scheme. 
In essence, we have committed ourselves to providing the reader with those 
tools, and only those, indispensable to the understanding of subsequent chap- 
ters. The sentence Quod Satis also stems from here, as in ancient recipes: the 
ingredients are mixed, we hope, in sufficient quantities for the aim to be sat- 
isfied. We have also tried to emphasize clarity without sacrificing rigour too 
much. Furthermore we have sought to maintain the language at a “moderate” 
level, so as not to discourage the reader who is not too keen on appreciating 
the charms of abstraction. 
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A. Dynamical Systems and Newtonian Dynamics 
1.1 Dynamical Systems: Generalities 



In this book we shall deal mainly with Hamiltonian dynamical systems, which 
represent a very peculiar class of continuous dynamical systems. Therefore, 
instead of starting from the definition of abstract dynamical systems, we shall 
limit ourselves to the case that will be the object of our applications. Conse- 
quently, we shall define “dynamical system” , a set of m first-order differential 
equations in the variables xi, X2, • • • , 

dx ■ 

-^ = Xi{xi,X 2 ,... 1 = 1,2,... ,m, (l.A.la) 



or, in vector notation: 

X — X{x,t). (l.A.lb) 

In (l.A.la), the independent variable t represents the time and one as- 
sumes that the m functions Xi are defined in a suitable m -h 1-dimensional 
domain and there satisfy the conditions under which the theorem of ex- 
istence and uniqueness of the solution holds. ^ The m-dimensional space 
(xi,X2, . . . ^Xm) is called the phase space and, for the time being, coincides 
with the Euclidean space If the functions Xi, which define the vector 
field X, do not depend on time, the system is said to be autonomous. In 
general, a solution of the system (l.A.la) will be a vector x = x{xQ,tQ,t) 
verifying the initial condition x(xq,^o) = For an autonomous system, 
the translation property holds: if x{t) is a solution, then x{t — a), with a a 
constant, is also a solution. The proof is straightforward, because in this case 
the right-hand side of (l.A.lb) does not contain the time. In the case of an 
autonomous system, we can use one of the x^, for instance xi (if Xi 0), as 
the new independent variable, in place of t, and obtain the m — 1 equations 

dx2 _ X2 {x) dxs _ X3 (x) dXm _ Xm jx) A 

dxi Xi (x) ’ dxi Xi {x) ’ ' ’ ’ ’ dxi Xi (x) ' 



The solutions of the set (1.A.2) in phase space are called phase curves. If 
the existence and uniqueness theorem holds for the set (l.A.l), it will hold 
for (1.A.2) also; therefore the curves in phase space will not intersect. One 
must keep in mind that different solutions, x{t) and x{t — a), correspond to 
the same phase curve. 



^ The m functions Xi, . . . Xm must be continuous in all the m -h 1 variables in the 
assumed domain together with their partial derivatives with respect to the Xi. 
The last condition can be replaced by the weaker condition 



dX{x,t) 

dxj 



<K, 



with AT > 0, j = 1, 2, . . . , m 



or even \X{x,t) — X{y,t)\ < K\x — y\ (the Lipschitz condition). 
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If a point X — a exists for which Xi{a) = 0, then the point a = 
(ai,...,a^) is called a critical point or singular point. Obviously at such 
a point there is no possibility of reducing the system to a set of m — 1 equa- 
tions by eliminating the time, as we did in (1.A.2). A critical point can also 
be considered as a phase curve degenerated into a point. Moreover, being 
there = 0 (Vt), a critical point corresponds to an equilibrium solution] 
xi[t) = a^ (Vt) is a solution of the system and, for the theorem of existence 
and uniqueness, is the unique solution through a; thence, if y{t) is a distinct 
solution , it will be y{t) ^ a, Vt. However, it may happen that, for t — )* oo, 
some solution tends to an equilibrium solution; in this case the equilibrium 
solution is called an attractor. More precisely, a critical point x — a E is 
called a positive attractor if a neighbourhood C of the point a exists 
so that x{to) G i?a entails that in the limit t -hoo, x ^ a. If this happens 
for t — (X), X = a is said to be a negative attractor. 

Let us assume now that a solution x = x(t) of our system, defined in 
some interval / C M, with values x(t) G D C is such that x{t) — 
x(t -h T), with T > 0. In this case x{t) is said to be a periodic solution with 
period T. It is easy to see that x(t) = x(t + T) implies also that x{t) = 
x(t zb nT), with n any integer. If we consider the phase space of the system 
X = X (x), since for a periodic solution we have that xi,X2, . • . at the 
time to + T^ again assume the values they had at the time to, it follows that a 
periodic solution gives rise to a closed phase curve (or cycle). Making use of 
the translation property of autonomous systems and of the fact that several 
translated solutions correspond to the same phase curve, one can demonstrate 
also the inverse: to a closed phase curve there corresponds a periodic solution. 
As an example, let us consider the system 



Xi = X2, X2 = -Xi. 



(1.A.3) 



The solutions are linear combinations of sint and cost (one solution is ob- 
tained from the other by means of a translation of tt/2) and we have for 
X = (xi,X2) a periodic solution of period T = 2tt. The phase curves are 
obtained by transforming the set (1.A.3) into 



dX2 Xi 

dxi X2 ’ 



(1.A.4) 



as in (1.A.2). Integrating (1.A.4), we have: 

X 1 +X 2 — const.. 



(l.A.5a) 



i.e. the phase curves are concentric circles (Fig. 1.1). From xi = a cost -f 
bsint, X 2 = —a sint + 6 cost, a, 6 G M, it is evident that the critical point 
xi = X2 = 0 is not an attractor. 

If instead we consider the system 



Xi = Xi, X2 = X2, 



(1.A.6) 
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the phase curves will be 



dx2 _ X2 
dx\ x\ ’ 



(1.A.7) 



from which X 2 = o;xi, a G M. The phase curves are straight lines through 
the origin (Fig. 1.2) and x\ = ciexp(f), X 2 = C 2 exp(t) so that the equilibrium 
solution (0, 0) is a negative attractor. The arrows in the figures indicate the 
direction of the motion of the phase point along the curves with increasing 
t. This motion is called the phase flow and the plot of a set of phase curves 
(corresponding to different initial conditions) the phase portrait 

In the two examples proposed, it has been possible to integrate the system 
(1.A.2) and to obtain a relation among the components of x; this relation 
defines an (m — 1) -dimensional manifold, a subset of the m-dimensional phase 
space: it represents the level set of a function of xi, X 2 , . . . , Xm- Such a level 
set, which we can in general indicate with 



F (xi,X 2 , . . . ,Xm) = const., (1.A.8) 

is called an integral manifold and the function F{x) the first integral of the 
system x = X{x). Generally speaking, we say that a real differentiable func- 
tion F{x) is a first integral for the system x = X{x) if it stays constant along 
a solution of the system itself, that is. 



dt 



dF , dF . 

Q ^ ^2 + • • • 

OX I 0X2 



9F . 

o ^rn — 0- 

OXm. 



(1.A.9) 



The existence of a first integral has the result of reducing by one the 
dimensions of the phase space available for the motion of the system. In 
turn, the phase curves will be subsets of this (m — l)-dimensional manifold. 
In the two cases we have seen, being m = 2, the phase curves coincide with 
the family of integral manifolds. 

Since the examples we have considered up to now were extremely sim- 
ple (linear systems with constant coefficients like (1.A.3) and (1.A.6)), the 
method of integration based on lowering the order of the system by means 
of the first integral may appear an unnecessarily cumbersome procedure. 



1.1 Dynamical Systems: Generalities 
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However, it is essential in the more complex cases, so it is worth explaining 
its application. Consider the system (1.A.3), the harmonic oscillator. As we 
know, the solution can also be written a,s Xi = x = Asin(^ + 0), X 2 = x = 
Acos{t + (^), with A = \/a^ -h 6^ , 0 = arctan(a/6). From the first integral 

(l.A.5a), which we rewrite as 





\{xi+xi) = h, 


(l.A.Sb) 


and the first equation of (1.A.3), we get 






II 

H- 

1 


(I.A.IO) 


And, integrating. 




(l.A.lla) 


that is. 


t + /,=arcsm(j^), 


(l.A.llb) 



where I 2 is an integration constant. By inverting (l.A.llb), one finally^ has 



Xi = + I 2 ), (1.A.12) 



which coincides with the above-mentioned solution if we set A = \/27i, 0 = 
/ 2 . The integration constants are evaluated by means of initial conditions; if 
we put xi(0) = xj, xi(0) = X?, we have: 



Summarizing, the procedure we followed consists of four steps: 



(1.A.13) 



J2 = (p = arctan -^77 = arctan — . 

xV b 



a) to find a first integral; 

b) to use the found integral to reduce the order of the system; 

c) to integrate the reduced system (if the original system were of order two 
only a quadrature remains to be performed); 

d) to invert and so to obtain the solution. 



From what we have said, it is obvious that to integrate a system of order m 
explicitly (i.e. to reduce it to quadratures), one needs m — 1 first integrals. 



^ The ambiguity in sign of (I.A.IO) is inherent to the problem: during the oscillatory 
motion between the two values of xi (points of inversion of the motion) given by 
±V2/i, xi changes sign going through zero at the inversion points. The integral 
of (l.A.lla) will be evaluated by taking the -h sign for positive dx\ and the — 
sign for negative dx\. Then the value of the integral increases with increasing t. 
We shall meet the same problem again later on. 
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Up to now we have considered first integrals which were functions only of 
the variables X\^X 2 ^. . . , Xm and not of the time t, according to the definition 
(1.A.8). However, sometimes integrals exist which depend also on time. One 
of the best known cases is given by the system describing a damped oscillatory 
motion: 

Xi — X2, X2 = -Xi - Ax2, (1.A.14) 

with A a positive constant. The first integral is given by: 

e^^{x\ + ^2 + \x 1 X 2 ) = h = const. (1.A.15) 

We are not interested in continuing our discussion of the system (1.A.14), but 
we shall employ the systems (1.A.3) and (1.A.14) to anticipate a classification 
which is of paramount importance: the distinction between “conservative” 
and “non-conservative” systems. If we put x\ — x and X 2 = x\ = x, the two 
systems are equivalent respectively to x = —x and x = —x — Xx. We call a 
system conservative for which 



x = fix) (1.A.16) 

and non- conservative for which 

X — g{x, x). (1.A.17) 

The first case, in mechanics, represents a force deriving from a potential (and 
depends only on the position) and the second, a force depending also on the 
velocity. In the first case we have conservation of the total energy. 



1.2 Classification of Critical Points — Stability 



Let us assume that the system (l.A.l) has a critical point at cc = a; in this 
case, the Taylor expansion in the neighbourhood of a? = a is 



X = X{x,t) = ^{x,t) 



{x — a) + 0(|a; — ap). 



(1.A.18) 



If we limit ourselves to considerering only the first term of the expansion 
(1.A.18), this means that we replace the system (l.A.l) by its linear approxi- 
mation: i.e. we have linearized the system. As we shall see later on, the linear 
approximation is much more meaningful and rich in information about the 
solutions of the complete system than one could expect. In the case in which 
the complete system is autonomous, the square matrix m x m, which we have 
indicated symbolically by {dX /dx)\x=a^ will have constant elements. In this 
case it is convenient to perform a translation so as to shift the critical point 
to the origin. 
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Therefore, without loss of generality, we shall limit ourselves to studying 
an autonomous system with a critical point at the origin. Then the linearized 
system will be 

X = Ax, (1.A.19) 

A being a constant matrix with elements given by 



Aij — 



dXj 

dxj 



x=0 



We shall assume that det A 7^ 0 {A is a non-singular matrix) and look for 
solutions of the system (1.A.19) of the form 

Xl=Q;le^^ X 2 =a 2 e^\ ..., (1.A.20) 



with a\, a2^ ... , otm and A to be determined. By substituting in (1.A.19), 
one gets the set: 



(All — A)ai -h A12OI-2 + . . . + AimOLm — 0 , 

A2lOLi + (A22 — A)a2 + . . . -f A2m0^m = 0 , 

(1.A.21) 



AmlOLl + Am20t2 + . . . + — A)arn = 0, 

which will have solutions different from ai = a2 = . . . ~ am = 0 only in the 
case in which the determinant of the coefficients will be zero: 

det(A- Al) =0, (1.A.22) 

1 being the identity matrix. Equation (1.A.22) is called the characteristic 
equation of the system (1.A.19) and det(A - Al) the characteristic poly- 
nomial. Equation (1.A.22) is an algebraic equation of degree m and will 
have m roots, real or complex, and not all necessarily distinct. The val- 
ues of A, roots of (1.A.22), are the eigenvalues of A. When the m roots 
Ai,A2 , . . . , Xm are distinct, by substituting them into the set (1.A.21), one 
obtains corresponding to each of them an m-tuple 

=oS^\ (1.A.23) 

where the m-tuple corresponding to the eigenvalue Xj is labelled by 
is said to be the eigenvector corresponding to the eigenvalue Xj. Since we 
are dealing with a homogeneous system, one of the components of the vector 
ol^A is arbitrary, i.e. it is defined up to a scalar factor. In the case of distinct 
eigenvalues, one can demonstrate that the m eigenvectors ol^A are linearly 
independent. 

With the vectors ol^A ^ one then constructs the m solutions of the system 
(1.A.19), obtaining 
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X 



X 






(1.A.24) 



Therefore, the general solution of the system (1.A.19) will have the form 

m 

X = (t), (1.A.25) 



where Cj are m arbitrary cost ants. However, in practice, instead of deter- 
mining the vectors from the set (1.A.21), it is preferable to look for the 
general solution of the system (1.A.19) in the form 



j=i 



(1.A.26) 

j = l 



where the \j are the (distinct) roots of the characteristic equation and the 
are determined by substituting the Xj(t) in the system (1.A.19) and 
equating the coefficients of the same exponentials The will depend 
on m arbitrary constants. We shall not deepen the general case (any m) 
further, limiting ourselves to the case m = 2, which allows the use of a 
graphical representation and thus a more immediate understanding. 

In this case the system (1.A.19) will become 



xi = Aiixi + A12X2, 

±2 = ^21^1 + ^22^2, 



(1.A.27) 



and the equation for the phase curves, 

dx 2 _ ^213^1 + ^22^2 
dxi A\\X\ -|- A12X2 



(1.A.28) 



In general (1.A.28), which in any case can always be integrated, is too 
complicated to allow a discussion of the geometric nature of the solution in 
a simple way. It is more convenient to rely on a non-singular linear transfor- 
mation, given by a suitable matrix S , such that 






(1.A.29) 



^ being a new vector. A transformation of this type may consist of a rotation 
and a dilation (or contraction) : curves get deformed but the behaviour in the 
neighbourhood of (0, 0) does not change. 

From (1.A.29), ^ = S~^ x and ^ = S~^x = S~^ Ax . Then 

k = S~^ASi = B^. 



(1.A.30) 
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The matrices A and B are said to be similar and have the same charac- 
teristic polynomial.^ Now we must determine S so as to obtain the simplest 
matrix B as possible. One can show^ that there is a real non-singular matrix 
S such that B is equal to one of the following six matrices: 




where in (a) A 2 < Ai < 0 or 0 < A 2 < Ai, in (b) A 2 < 0 < Ai, 

c) (J “). d) (J J), (1.A.31) 

where A > 0 or A < 0. 




where in (e) u^v ^ 0 and -u > 0 or < 0; in (f) v ^ 0; Ai and A 2 are the 
roots of the characteristic polynomial (Ai = u iv, A 2 = u — iv when they 
are complex conjugate). 

Different phase portraits correspond to the cases listed in (1.A.31). Case 
(a) is pictured in Fig. 1.3, in which the arrows indicate the direction of time 
and correspond to negative roots Ai, A 2 : the origin is called an improper 
stable node.^ If Ai, A 2 > 0, the picture is obtained by inverting the arrows 
and exchanging with ^ 2 - we have an improper unstable node. 

Case (b) corresponds to Fig. 1.4 (A 2 < 0 < Ai) and the origin is called 
a saddle point. For case (c), the picture is given by Fig. 1.5 and we have a 
stable proper node corresponding to A < 0; an example of the case A > 0, 
an unstable proper node, is given by the system (1.A.6) of Sect. 1. Case (d) 
gives a further case of improper node (Fig. 1.6). The node is stable (if A < 0, 
the case pictured in our figure) or unstable (if A > 0). For complex conjugate 
roots, case (e), we have a spiral point: stable u < 0, unstable ii u > 
0. In Fig. 1.7 we have pictured an unstable spiral point. Case (f), u = 0, 
corresponds to imaginary roots and the origin is called the centre] in Fig. 1.8 
we have considered > 0. The case of the linear oscillator with unit frequency 
considered in (1.A.4) is the same but with inverted arrows (it corresponds to 
V < 0). 

Let us continue, now, with the autonomous system 

Xi = Xi{xi,X2, . . . ,Xm), 2 == 1,2, . . . ,m, (1.A.32) 

^ See, for instance, F. R. Gantmacher: The Theory of Matrices (Chelsea, New York, 
1959), Chap. III. 

^ For this, and other assertions given without proof, see F. Bauer, J. A. Nohel: The 
Qualitative Theory of Ordinary Differential Equations. An Introduction (Dover, 
New York, 1989), Chap. 2 and Appendices. 

^ The meaning of the term “stable” will be made clear later on in this section. 





Fig. 1.6 Fig. 1.7 Fig. 1.8 



assuming again the validity of the existence theorem in a suitable m-dimens- 
ional domain. Moreover, let us assume that x = a is a critical point for our 
system. We shall introduce for it the concept of stability according to Lya- 
punov, or L-stability. We say that the critical point, or equilibrium solution, 
Xi(t) = a^ is stable if, for each number £: > 0 , we can find a number (5 > 0 
such that: 

i) any solution of (1.A.32), which belongs to the neighbourhood of a with 
radius 8 for some t = ti, he defined for any ti <t < oo; 

ii) a solution satisfying (i) remains in the neighbourhood of a with radius 
£ for any t > t\. 

If, furthermore, each solution satisfying (i) and (ii) is such that 

iii) the limit for t — > oc of Xi{t) is a^, Vi, then the point a is said to be 
asymptotically stable. 

If we now apply these definitions to the linear system (1.A.27) and to the 
considered phase portraits, we can immediately verify that: 
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1) we have asymptotical stability only when the two roots are negative real 
or complex conjugate with negative real part, 

2) the centre (imaginary roots) is stable but not asymptotically stable, 

3) in all other cases, we have instability. 

In the case of a general non-linear system, as a rule the solutions are 
not available, so we cannot directly study their behaviour as for the linear 
system with constant coefficients (1.A.27). However, Lyapunov has shown 
that, in most cases, we can get knowledge of the nature of a critical point of 
a general system (1.A.32) merely by referring to the system obtained from it 
by linearization. More precisely, 

1) if all the roots of the characteristic equation pertinent to the linear 
system (1.A.19) have a negative real part, then the equilibrium solution 
Xi = 0 of the system (1.A.32) is asymptotically stable, as if the system 
consisted only of the linearized part; 

2) if among the roots of the characteristic equation for the linear system 
there is at least one A with a positive real part, then the solution Xi = 0 
of the system (1.A.32) is unstable, whatever its non-linear part may be; 

3) if the characteristic equation of the linear system has no roots with a 
positive real part, but has some imaginary or null root, then we have the 
so-called critical cases and the stability (or instability) will depend on 
the non-linear part of the system (1.A.32). Needless to say, if the system 
(1.A.32) had a critical point with coordinates ^ 0 , one performs a suitable 
translation of the coordinate axes. 

Before closing, we wish to give, concisely, some definitions to outline how 
the concepts of dynamical systems can be set in abstract form. 

First of all, the phase space will be, in general, a differential manifold^ 
{M.) . We have already associated the phase flow to the motion of phase points 
along phase curves. More formally, we can now introduce the pair {M, {^t}) 
given by the manifold M and by a one-parameter group of transformations 
of M onto itself. The generic group element will be given by a transformation 
depending on the parameter t : 

x,^txeM (VteM). (1.A.33) 

For the group properties hold: 



^0 = identity transformation, (1.A.34) 

= inverse trasformation, 

where o means composition of the two transformations. 



See, for instance, B. F. Schutz: Geometrical Methods of Mathematical Physics 
(Cambridge University Press, Cambridge, 1980). 
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If, given a? G A^, we consider the mapping 

^ : R -> Ad, (1.A.35) 

we shall call ^ the motion of x due to the phase flow (Ad, The phase 

curve will be the image of the mapping ^ : R Ad, i.e. a subset of the phase 
space Ad. Moreover, a critical or equilibrium point of a flow (Ad, {^t}) will 
be a point x G Ad which is, itself, a phase curve: 

— X, (Vt G R). 

Finally, recalling that a diffeomorphism is a mapping / : Ad -> AA that is 
bijective and differentiable together with f~^ : AA — Ad, we say that {^t} is 
a one-parameter group of diffeomorphisms of Ad. In fact {^t} is a mapping 

^ : R X Ad — )► M, ^ {t, x) — ^tx, (t G R, X G Ad) 

such that: 

a) ^ is a differentiable mapping 

b) VtGR, Ad^ Ad 

c) the family {^t, t G R} is a one-parameter group of transformations of 

Ad. 



1.3 The n-Dimensional Oscillator 

As we have said, the system (1.A.3) represents the harmonic oscillator of 
unit frequency; if we consider a frequency cJi 1, the Newtonian equation 
of motion will be 

xi = —uJiXi. (l.A.36a) 

If we put, differently from before, Xi = uJiVi, the corresponding system^ will 
be 

xi=ujivi^ vi = —ouiXi, (l.A.36b) 

and the phase space will be the XiVi plane. 

Let us now consider in the Cartesian X\X 2 plane a second linear oscillator 
whose equation is 

X 2 — —^2 X 2 , (l.A.37a) 

and the corresponding system 

X2 — UJ2 V2, V2 = -^2 ^2- (l.A.37b) 

By the composition of the two motions given by (l.A.36a) and (l.A.37a) we 
obtain a new physical system: a planar oscillator. This can be thought as 
consisting of a point of unit mass subjected to two orthogonal oscillations. 
The corresponding dynamical system is 



^ Obviously, v\ is proportional to the velocity, but is not a “true” velocity from the 
dimensional point of view. 
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xi =u;ivu vi = 

X2 = OJ 2 V2, V2 = -^2 X2, 



(1.A.38) 



the phase space is 4-dimensional and the phase curves are subsets of this 
4-dimensional space; their projections on the X 1 X 2 plane give the trajectories 
(orbits) of the moving point. Different orbits may intersect, whereas the phase 
curves, as we know, may not. As in (l.A.5a), the first integral can be found 
immediately (corresponding to the conservation of the total energy of the 
planar oscillator): 



/ = 




+ x?) 




(V2 + X 2 ) = E. 



(1.A.39) 



The 3-dimensional surface defined by (1.A.39) is the energy integral manifold: 
it is a subset of the 4-dimensional phase space (R^) and is invariant under 
the phase flow. On the other hand, having obtained the system (1.A.38) by 
putting together the two independent systems (l.A.36b) and (l.A.37b), one 
finds that the solutions, a motion in R^, will be 



xi = Ai sin(LJit + (pi), 
X2 = A 2 sin(cc;2t -h (^ 2 )^ 
vi = Ai cos{uJit -h (pi)^ 
V2 = A 2 cos(u;2^ + (P2)^ 



(1.A.40) 



and the space R^ results from the direct sum of two invariant planes: R^ = 
+R ^2 The phase curves are given by the circles on these two planes 

= {(a;i, vi) e xj + vl = Al}, 

= {(2:2, V2) e xl + vj = Al}, 

and by the points given by Ai = 0, A 2 = 0. 

The phase flow consists of rotations of angles c^it, (jj 2 t. Every phase curve 
of the system (1.A.38) belongs to the direct product of the phase curves in 
the planes R^ 2 ,V 2 * The direct product of two circles will be 

= S'! X = {{xi,vi,X2,V2) e R^ : X? = Aj.xl + = A 2 }, 

i.e a two-dimensional torus. As is known, may be imagined to be ob- 
tained by making a circle rotate around a coplanar axis which lies outside 
the circle itself (Fig. 1.9). A point on the surface of the torus corresponds 
to two angular coordinates cpi, (p 2 (mod. 27 t), which may be called longitude 
and latitude^ respectively. The coordinates (^ 1 , (p 2 define a diffeomorphism 
between the surface of the torus and the direct product of two circles (T^). 
One can also imagine “building” the two-dimensional torus by starting from 
a rectangle and “gluing” first two opposite sides, obtaining a cylinder, and 
then, by suitable deformations, the remaining two sides (already transformed 
into circles). Therefore, in our case, the square 0 < (pi < 27t, 0 < (/92 < 27 t in 
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the (pi(f 2 plane can be considered a map of the torus T^. The pair of points 
((/?i,0), ((/^i,27t) and (0,(^2), (27t,(/?2) are identified in 

The fiow of the system (1.A.38) leaves the torus C invariant. The 
phase curves of (1.A.38) lie on the surface T^. Now, if (fi is the polar angle 
in the plane (considered positive in going from the positive v\ axis to 

xi) and the same happens for ip 2 in the X 2 V 2 plane, from (1.A.40) we have 



= Ul2t + (fi2- 



(mod. 27 t) 



(l.A.41a) 



The two (l.A.41a) are the equations of the phase curves. On the surface of 
they represent a helix, whereas in the “map”, that is, in the (pi(f 2 plane, 
we have the straight lines 

(7^2 = — + I ^2 ~ ^ ] (mod. 2tt). (l.A.41b) 

LUi \ J 



Now the following proposition holds: 

If ui and (jj 2 are rationally dependent,^ every phase curve (l.A.41a) on the 
torus is closed; if, on the contrary, uoi and UJ 2 are rationally independent, 
every phase curve is everywhere dense^ on the torus T^. 

To the proof we shall premise the following lemma: 

If the circle is made to rotate on itself by an angle a not commensurable 

with 27 t, the images of each point due to repeated application of the 
rotation a (mod. 27 t) constitute a set everywhere dense on the circle. 

To demonstrate this, let us split up the circle into k arcs corresponding to 
angles of amplitude 27 t//c. If we perform /c + 1 applications of the rotation a, 
because of the well-known principle of the Dirichlet cells we shall have two 
images of the same point of the circle in one arc, these images being different 
since a/27T is not rational. Let us assume that these two images correspond 
to the angles (p-\-pa and p-\-qa {cp is the initial reference angle), with p > q, 
to fix the ideas. Let us put 5 — p — q\ then the angle sa differs from a 
multiple of 27 t by an amount smaller than 27r/fc. Therefore, if we consider the 
sequence p), p) sa, p> -\- 2sa, p -h 35a, . . . (mod. 27 t), any two consecutive 
terms differ by the same quantity, which is smaller than 27 t/A:. Then, if we 
fix an arbitrary £ > 0, a /c will always exist such that 271 /k < e, and, as a 
consequence, in every neighbourhood of a point of the circle of amplitude s, 
there will always be terms of the sequence (p -h Nsa (mod. 27 t). Of course, 
what has been demonstrated above holds since a is not commensurable with 

^ Two numbers and uj2 are rationally dependent if two integers k\,k2 7^ 0 exist 
such that kiUJi -f- k2002 — 0, and rationally independent if this happens only for 
ki — k2 = 0 . 

^ A set A is everywhere dense in B if, in an arbitrary small neighbourhood of any 
point of B, there is at least a point of A. 
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27t; otherwise an integer n would exist such that a rotation by an angle na 
should be equal to an integer number m of rotations of 27 t, and therefore 
the image of every point would be superimposed on the “preceding” images 
every m complete turns. 

Let us go back, now, to the proposition concerning the phase curves. 



a) 



Let (jJi and L 02 be rationally dependent, i.e. kiCJi + k 2 i 02 — 0 with ki and 
/c 2 non-zero integers. Then it is sufficient to define the period T as given 
by 



T = 








One immediately checks that this makes the curve (l.A.41a) closed on 
the torus. 

b) Now, let LOi and lu 2 be rationally independent, i.e. uj 2 l^i is irrational. By 
referring to the map of in the (pi(f 2 plane, let us consider the sequence 
given by the points in which the straight lines (l.A.41b) intersect the axis 
(p 2 (mod. 27 t). These points will have as coordinate (p 2 



n ^2 

+ 27t/c 

UJl 



(mod. 27 t). 



From the above lemma, it follows that the set given by these points is 
everywhere dense in the segment 0 < (/?2 < 27t. As a consequence, the set of 
the segments belonging to the above straight lines is everywhere dense in the 
square 0 < (^1 < 27t, 0 < (f 2 < and moreover the corresponding phase 
curve is everywhere dense on T^. 





Note that is possible to show that the above case can be generalized to n 
independent harmonic oscillations. Therefore one can demonstrate that the 
corresponding phase curves lie on an n-dimensional torus 

X X X ' - = {{(fi, (p2, • • • , mod. 2tt] 



n times 
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and satisfy the n independent equations 



(p — LUl, P>2 — ^ 2 , • • • 5 — ^n- 

Moreover, if the frequencies cji , u;2 , • • • , are rationally dependent, every 
phase curve is closed; in the opposite case every phase curve is everywhere 
dense on the torus T^. For the planar oscillator one can give a graphical 
representation by exploiting the energy integral. The two oscillatory motions 
being independent, besides ( 1 .A. 39 ) we also have 

Ii = [vf + xl) = El — const. 

J ( 1 .A. 42 ) 

h’l + ^2) = £'2 = const. 

From ( 1 .A. 42 ), since kinetic energies must always be greater than 0 , we have 



|a;i| < 






\X2\ < 




Thence, the orbit of the point submitted to the two orthogonal oscillations 
is confined in the rectangle [xil < Ai, |a;2| < A2. For the same reason, from 
( 1 .A. 39 ), we have 

+ uj^xl < 2 E = 2 /, 

i.e. the motion in the X\X2 plane is confined to the ellipse 



^2E 






2 E 



= 1, 



with El ^ E2 — E (see Fig. 1 . 10 ). 

It is easy to check that the rectangle turns out to be inscribed within the 
ellipse. We have seen in ( 1 .A. 40 ) that 

xi — Ai sin {(jJit -h 
X2 = A2 sin {ijj2t A- P2)' 



Of course, the determination of the actual orbit must be done by composing 
the two independent oscillations of amplitude Ai and A2 and frequencies uoi 
and 0J2 respectively. The task is not simple and depends, obviously, on the 
values of the ratio uJiluo2 and of the phase shift 9^2 ~ practice, one can 

use the following strategy. 

A sinusoid with period 2 t^ Ai{uJi / UJ2) and amplitude A2 is drawn on a 
strip of width 2A2 (see Fig. 1 . 11 ). The strip is wrapped round a cylinder of 
height 2A2 and diameter 2 Ai. The orbit in the X1X2 plane is obtained by 
projecting the sinusoid on this plane. The various orbits are called Lissajous 
figures. One can show, in the same way as for the phase curves, that, if 001/002 
is a rational number, the trajectory is a closed algebraic curve and, if 001/002 is 
irrational, the trajectory fills the rectangle such that it is everywhere dense. 
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Let us examine the simplest cases with rational 

1) uj\ — UJ 2 (isotropic oscillator): The curve on the cylinder is an ellipse. 

The projection of the ellipse on the X\X 2 plane depends on the choice 
made for the directrix of the cylinder (which is projected in the axis X 2 ), 
i.e. on the phase shift ^2 ~ ^2 — ^15 we obtain the diagonal 

of the rectangle; for small values of ^2 ~ ellipse squeezed on the 

diagonal; for (/?2 — = tt/ 2 , the ellipse with principal axes on x \ and X2, 

etc. (Fig. 1.12). 

2) lj 2 = 2o;i: See Fig. 1.13, in which Lp 2 ~ ^1 ranges from — tt/ 2 to 0. 

3) 2lu2 = 3u;i: In Fig. 1.14, (^2 ~ ranges from — tt/ 4 to 0. Of course, 
the diagonal in case (1) and the parabola in case (2) (the first curve in 
Fig. 1.13) are closed curves, i.e. described twice. 
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B. Lagrangian Dynamics 
1.4 Lagrange’s Equations 



Given a physical system consisting of N mass points, its configuration will 
be described at any time by means of 3N coordinates, for instance Cartesian 
coordinates (i = 1, 2, . . . , 3N)^ with the convention that 



ri = (xi, X2, X 3 ), 
r 2 = (x4, X 5 , xe). 



Vn — {xsN-2, XSN-I, Xsat); 

rj, (j = 1, 2, . . . , A^), being the distance vector of the generic point. In gen- 
eral, among the 3N coordinates, a certain number p, with p < 3iV, of kine- 
matical conditions (constraints) may exist; when they are finite relations, the 
system is called holonomic; when they are not integrable differential relations, 
the system is non-holonomic. In view of the applications in which we shall be 
interested, we limit ourselves to considering only holonomic systems, even if 
the results of this section can be extended to the non-holonomic^^ ones. 

The relations among coordinates will be of the type 

/m(a:i,X2 ,... ,X 3iv;t) = 0, m = 1,2, ... ,p. (l.B.l) 

In (l.B.l) the case in which the relations among coordinates may depend 
explicitly on time is also taken into account. When p relations like (l.B.l) 
hold, obviously the system will actually depend on only n — 3N — p coordi- 
nates; usually one expresses the xi in terms of n parameters, or generalized 
coordinates, or Lagrangian coordinates, which are traditionally indicated by 
grfc(A: = 1,2, ... ,n): 

Xi Xi (^1, Q 2 ’! • • • 5 Qm 0 ’ ^ f 5 ^5 • • • 5 3N. (I.B.2) 

The number n of parameters necessary (and sufficient) to completely charac- 
terize the configuration of the system is the number of “degrees of freedom” 
of the system itself. The time derivatives of the written as qk^ will be the 
generalized velocities; it must be kept in mind that, in general, they are not 
velocities in the strict sense. As, at any time, the configuration of the system 
is specified by the n-tuple (^1, ^2, • • • , 9n), one can make the evolution of the 
system correspond to the motion of a point of coordinates (g'l, ^2? • • • , ^n) , 
in an rz-dimensional space Q {configuration space). 



^°See H. Goldstein: Classical Mechanics, 2nd ed. (Addison- Wesley, 1980), p. 45. 
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When the system consists of N mass points, the total kinetic energy will 



be 



^ N 3N 



rrik 



dxk 

dt 



i=l k=l 

thus defining, in a natural way, the metricd^ 

3N 

ds^ = 2T dt^ = ^ {y/m^dxkf 

k=l 



(1.B.3) 



In the case of N free mass points (i.e. without constraints), we can take 
the y/rrik Xk as rectangular Cartesian coordinates, and therefore the SN- 
dimensional configuration space has a Euclidean structure: any transforma- 
tion to curvilinear coordinates cannot change its real nature. The squared 
line element would take the form 



3N 

ds^ ^ gikdqidqk, (1.B.4) 

i,k=l 



with a generic Riemannian appearance, but the geometry would remain Eu- 
clidean. If, on the other hand, p relations of the type (l.B.l) exist, things 
change substantially. Geometrically, each kinematical condition represents 
an hypersurface in 3A^-dimensional configuration space. The intersection of 
all hypersurfaces determines a {3N — p)-dimensional subspace in which the 
representative point is constrained to remain. This subspace is no longer Eu- 
clidean, but in general Riemannian. 

One can also make the choice of expressing xi, X2, ... , xsn as functions 
of n = 3N — p independent parameters ^ Qn from the start and 

then to replace their differentials dxi, dx 2 , . . . , dxsN in the expression of 
ds"^. In this case one gets 



ds — ^ ^ f^ik dQi (I.B.5) 

i,k=l 

where the coefficients aik are functions of the g’s. Now the line element has 
a true Riemannian structure, not only because the g’s are curvilinear coordi- 
nates, but because the geometry of the (n-dimensional) configuration space 
does not retain the Euclidean structure of the original 3Wdimensional space. 
In the further case of non-holonomic constraints, one cannot proceed in this 
way and must therefore consider the 3N'-dimensional space with the addition 
of the constraints. In this case, we no longer have a definite subspace of the 
3N'-dimensional space, because the kinematical conditions prescribe certain 
pencils of directions, but these directions do not envelope any surface. 



^^See (A. 6) in the Appendix and the following discussion. 
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If we return now, to Cartesian coordinates and consider a system of N 
mass points in which forces can be derivable from a work function (or po- 
tential) C/(xi, X 2 , . . . , xsn) = U{x)^ the kinetic energy is still expressed by 
the second equation of (1.B.3) and we shall call the function 

C = C{xi,X 2 ,... ,X3N', Xi,± 2 ,... ,X3n) = T + U. (1.B.6) 

the kinetic potential, or Lagrangian function, or just Lagrangian for short. 

In the case proposed above as U is not dependent on time, the system is 
conservative and we also have 



U = -V, C = T-V, 



(1.B.7) 



where V is the potential energy. In the most general case, we may meet a 
potential function depending on time and thence a Lagrangian depending ex- 
plicitly on time. Therefore, we shall often write C — C {x, x, t) to emphasize 
that the results obtained are valid also in the most general case. It can now 
be seen immediately that the Newtonian equations of motion can be written 



dC d dC 

dxi dt dxi 



(i = l,2,... ,3AT). 



In fact (l.B.Sa), with the substitution of (1.B.6), gives 



rUiX^ 



dxi' 



(i-1,2,... ,3iV). 



(l.B.Sa) 



(l.B.Sb) 



Equations (l.B.Sa), universally known as Lagrange’s equations, contain more 
than may appear at first sight: i.e. they are not merely another form in which, 
in the case considered, Newtonian equations of motion can be written. To 
further highlight the general significance of Lagrange’s equations, let us now 
show how they can be deduced from a variational principle, Hamilton's prin- 
ciple. This is an integral principle which describes the motion of those me- 
chanical systems in which all forces (except those deriving from constraints) 
can be derived from a single generalized potential function depending on co- 
ordinates, velocities and time. Let us assume, therefore, that for our system 
(l.B.l) and (1.B.2) hold, i.e. the system has n degrees of freedom and his 
configuration is described by n generalized coordinates qi, , 9n- The 

principle can be stated as follows: the motion of the system in the arbitrary 
time interval ^ 1,^2 is such as to make the action integral 



1 = C{qi,q 2 ,... ,q„] qi,q 2 ,---qn, t)dt (1.B.9) 

Jti 

stationary for independent variations of the g/^’s, which we indicate by 6 qk^ 
with only the prescribed condition that they vanish at the times ti and ^ 2 - 
That is. 



61 = 0, with Sqk\t=ti = 6 qk\t=t 2 = 0> Vfc. 



(I.B.IO) 
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By applying the standard methods of the calculus of variations, (1.B.9) and 
(I.B.IO) immediately give 



dC d dC 

dxk dt dxk 



{k= 1,2,... ,n). 



(l.B.ll) 



An elementary check can be made in the case of only one degree of freedom: 
C = t) and then 

^ = f C.{q,q,t)dt. 

Jti 

One has to determine a curve q — q{t) such that the variation of the integral 
/, when passing from evaluating it along the curve to evaluating the same 
integral along any neighbouring curve, vanishes. The curve is assumed to join 
two fixed points (^^1,^1), (^25^2)* the integral is evaluated between these two 
points. Therefore one must look for the necessary and sufficient condition in 
order that the integral I be stationary: 



pt2 pt 

; / C{q, q, t)dt = / 

Jti Jti 

-i 

-L 



dq 



t 2 



t 2 r 



dC dC J 
dC^ dCd,^^ 
d (dc , 



dt 



dt 



Sq 



^2 pt 2 


'dC 


d 


^d£V 


+ / 

ti dti 


dq 


dt^ 





eir. 

6 qdt. 



dt 



The first term in 
fixed, and thence 
the curve. 



the last row is zero, since the end points of the curve are 
= 0 for t = ^ 1 ,^ 2 - Moreover, as 5 q is arbitrary along 



J/ = 0 



dq dt dq 



The procedure is then extended to the case of n degrees of freedom. What 
we have seen assures us, therefore, that Hamilton’s principle is a sufficient 
condition for obtaining the equations of motion; one can also show that it is 
a necessary condition, by deriving Hamilton’s principle from the Lagrange’s 
equations. 

In deriving Lagrange’s equations from Hamilton’s principle, a point of 
fundamental importance is raised: the non-uniqueness of the Lagrangian of 
a system. In fact, if in (l.B.ll) we replace C by £ + F, F being the time 
derivative of a function F = F{qi,q2, ... , gn; - - An] t), the content of 

the equations of motion does not change because the definite integral relates 



^^See, for instance, E. T. Whittaker: A Treatise on the Analytical Dynamics of Par- 
ticles and Rigid Bodies (Cambridge University Press, 1937) p. 245 ff. 
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to the values of F at the end points where the g’s do not vary. Conversely, 
one can show that, if C and C' are two Lagrangians which give the same 
equations of motion, it must hold that^^ 

c' ^cff. 



Let us now show one of the fundamental features of Lagrange’s equations, 
that is their invariance in form under any transformation of generalized coor- 
dinates (and therefore also of reference systems). In fact, let us assume that 
our system has n degrees of freedom and is then described by n generalized 
coordinates. Let them be ^1,925- •• and (l.B.ll) the equations of mo- 
tion. If we define a transformation (endowed with all the properties required 
by later developments) to n new generalized coordinates , ^2 , • • • , 2 /n , 



qi = qi{yi,y2,--- ,yn), i = i, 2 ,... ,n, (1.B.12) 

we shall have for the Lagrangian of the system 

qi{yj,yj), t] ^C{yi,yi,t). (1.B.13) 

Moreover _ 

dyk dq^dyk dqidyk ^ 

where, for the sake of brevity, sum over repeated indices is implied. By in- 
verting the order of derivation in the second term of the right-hand side, we 
get 

dC ^ I ^ (k - I 2 n) 

dyk dqi dyk dqi dt dyk 

Since qi does not depend on the y/e’s, we also have 



dC _ dC dqi 
dyk dqi dijk 



where it must be remembered that 







and then 

oqi _ ^ 
dyk dyk 

{yk being one particular among the y^s). We can then collect together the 
results obtained in the form 



^^See, for instance, S. N. Rasband: Dynamics (Krieger, 1991) p. 56. 
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dC d dC dC dqi dC d / dqi \ d / dC dqi \ 

dyk dt dijk dqi dyk dqi dt \dyk ) dt \dqi dyt J 

_ dC d dC dqi 

dqi dt dqi\ dyk 

where the sum over i and /c = l,2,...,n is implied. 

Since, for (l.B.ll), every term in the square bracket vanishes, 



dC d dC 

dyk dt dyk 



(A: = 1,2, ... ,n). 



Therefore, (l.B.ll) have a much more general significance than would appear 
from the form (l.B.Sa): Lagrange’s equations are invariant with respect to 
an arbitrary coordinate transformation. Afterwards C {q^ q, t) will always be 
assumed to be C^, i.e. a function with continuous derivatives up to the second 
order in all its 2 n + 1 arguments. 

The system (l.B.ll) can also be written by developing the time derivative: 



dC d^C . d^C .. d^C 

o ^ - o. o. (Ik - =0 (^ = 1, 2, . . . , n); 

dqi dqidqk dqidqk dqidt 

therefore, so that the system can be written in normal form for the q’’s as 
unknown functions, the (Hessian) determinant given by the derivates 



d^^C 

dqidqk 



must always be different from zero. We assume that this is always true. 



1.5 Ignorable Variables 

and Integration of Lagrange’s Equations 



Lagrange’s equations (l.B.ll) are a system of n second-order differential 
equations in the unknown functions qk{t)^ and therefore, like the Newtonian 
equations, they correspond to a differential system of order 2n. Whereas gen- 
eral methods of integration of Lagrange’s equations do not exist, it may be, 
however, that the structure of the Lagrangian allows the existence of con- 
stants of the motion and thus the possibility of lowering the order of the 
system. This happens, for example, when the Lagrangian does not contain 
one (or more) of the g/c’s, though containing the corresponding generalized 
velocities qk- In the case in which C does not depend on qk^ dC/dqk — 0, 
and then, from (l.B.ll), 



dqk 



= Pk = const. 



(1.B.14) 
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The quantity defined in (1.B.14), is called the momentum conjugate to 
the generalized coordinate qk- We have already pointed out that the in 
general, do not have the dimensions of velocities; analogously, the pkS may 
not have the dimensions of momenta. When C does not depend on one 
says that qk is an ignorahle^ or cyclic^ variable. Therefore (1.B.14) states that, 
when qk is an ignorable variable, the corresponding conjugate momentum is a 
constant of the motion. Of course, it may happen that the Lagrangian turns 
out not to depend on one or more coordinates when a particular type of 
coordinate has been chosen; as an example, if we write the Lagrangian of a 
particle in a central field of force, there are no ignorable variables if we employ 
Cartesian coordinates, while the thing is evident in polar coordinates. This 
example suggests the existence of a connection between the absence of certain 
variables and the symmetries of the problem: the most convenient choice of 
the variables is that refiecting the symmetries of the physical problem. 

Let us now see how one can proceed in the case in which /c (0 < /c < n) 
ignorable variables exist. Let us assume, for convenience, that they are the 
first k coordinates; if not, one could always rearrange things in such a way 
as to return to this case. Therefore we have 

= z = (1.B.15) 

dqi 

where the q’s are constants whose values depend only on the initial condi- 
tions. It can be verified that, in any case, (1.B.15) are linear in gi, , Qk 

and therefore the first k Lagrangian velocities can be expressed as functions 
of Cl, C2, . . • , Ck; , 9n; • • • , that are linear with 

respect to the k c^’s and the n — k q's: 

Qi — Qi{ci,C2, ... ,Cfc; • • • .Qn, Qk-hl^Qk-\-2, • • • ,9n)- (1.B.16) 

Substituting (1.B.16) in the Lagrangian, one obviously obtains a function 
which depends only on n — k coordinates and n — k generalized velocities. We 
have, at this point, n — k oi the old equations which contain only the n — k 
non-ignorable variables and corresponding velocities (besides, of course, the k 
constants Ci) and the k equations (1.B.16). If it were possible to integrate the 
n — k second-order differential equations and obtain the qk-\-i,qk-\-2, • • • , as 
functions of t, of the c^’s and of the initial conditions, it would be sufficient 
to put them into (1.B.16) and obtain the gi, ^ 2 , • • • ,Qk through quadrature. 

Once it was taken for granted that, in general, the integration of the 
n — k differential equations is not possible, it was found more convenient 
(and meaningful) to proceed in a slightly different manner. One transforms 
the reduced system (of the n-k differential equations) in a true Lagrangian 
system corresponding to n - A: degrees of freedom. This system, and (1.B.15) 
rewritten in a suitable form, completely characterize the problem. To do this, 
one defines the reduced Lagrangian (Routh function): 
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^ pi P /c 

2=1 

which, by substituting (1.B.16) into it, becomes 1Z = 1Z{qr; qr\ Ci, t) with 
i = 1, 2, . . . , A: and r = /c + l,A; + 2, ... ,n. Let us show now that this function 
really represents the Lagrangian of a new system with n—k degrees of freedom 
with coordinates qkpi^qk+ 2 -, • • • he. the following system holds: 

dU d dn ^ , 

r = A: + l,A: + 2,... ,n. 1.B.18 

oqr dt oqr 

In fact, if we vary IZ starting from the definition (1.B.17), i.e. varying also 
with respect to the q’s, we get 



Pi P P) P ^ Pi P ^ 

S S'’’’' ^ S s*"' “ S 



r=/c+l 



dqr 



(1.B.19) 



r=l 



From (1.B.19), we then have 



on _ dC 

dqr dqr ’ 



and 



dn _ dC 

dqr dqr ’ 



V — /c-l-l,A:-f-2,... ^Ti 




i = 1, 2, . . . ,k. 



(1.B.20) 

(1.B.21) 



From (1.B.20), owing to the Lagrange equations for ^fc+ 2 , • • • we 
immediately have (1.B.18). Therefore the initial system with n degrees of 
freedom and k ignorable coordinates is transformed into a system with n — k 
degrees of freedom and the Lagrangian 1Z with the addition of equations 
(1.B.21) to determine, through quadrature, the k ignorable coordinates as 
functions of the time and the initial conditions. Let us see, in the case of only 
one ignorable coordinate, some interesting features shown by the reduced 
system. Before doing this, let us consider which structures are possible for 
the total kinetic energy of the system. If our system is subjected to holonomic 
constraints depending also on the time, from (1.B.2) we have 



I OiV 

■^'^rn.iX^ =To{q)+Ti{q,q) + T2{q,q), 

1=1 

where To is dependent only on the generalized coordinates, 7i is linear in the 
generalized velocities and T 2 is quadratic in the g’s. On the other hand, when 
Xi = Xi{qi,q 2 , . . . ,qn)^ ^ = 1,2,... , 3A^, that is, the holonomic constraints 
do not depend on time. 
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3N 

'^rriixf = T2{q,q,) 

i=l 



1 

2 



n 

Z,m=l 



with 



/ N. V ^ dxi oxi 
(^Im ~ ^lm\Q) — / ^mh 

^ dqi dqm 
2=1 



Systems of the last type are called natural systems; as a consequence, the 
Lagrangian of the natural systems will be of the form 



1 

= - ^ aiumk + u. 



(1.B.22) 



i,k—l 



Let us consider now a natural system with only one ignorable coordinate, 
e.g., qi. Then 

1 "" 1 "" 

T = CLuqiqi + ^ CiikqiQk 



2 = 2 



i,k=2 



and 



dr 

dqi 



n 

ciiiqi + ^^12^2 — 

2 = 2 



The Routh function, therefore, will become 



IZ = jC — ciqi 



:E 

i,k=2 



aikqiqk 



1 

2 



+ u. 



(l.B.23a) 



Now, the velocity qi must be replaced in 1Z with (ci - J27=2 This 

will give 



r. — ^ ^ OjikqiQk 

2,fc = 2 



i (Er=2^n42f 

2 ail 




i-^ + C/. (l.B.23b) 

2 ail 



Then the Routh function (the reduced Lagrangian) contains a part that is 
quadratic in the velocities, n — 1 terms linear in the velocities and a terni not 
depending on the velocities. We can consider the last term as an apparent 
potential function and combine it with U to give 

Ceff = C-i^. (1.B.24) 

2 an 

As to the terms that are linear in the velocities, they are called gyroscopic 
terms and are responsible for the fact that the new system with n - 1 degrees 
of freedom is no longer a natural system. 

Let us go back again to the result shown at the beginning of this section, 
i.e. to the fact that if a coordinate qi is ignorable, then the corresponding 
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conjugate momentum is a constant of the motion. We have already recalled 
the example of a particle moving in a central potential U — U{r). After 
having proved that the motion is planar, one can choose polar coordinates in 
the plane of motion: (r, 'd). The Lagrangian of a unit mass is then 

£= i(r2+r^,?2^+C/(r). (1.B.25) 

The coordinate i? is ignorable and, consequently, the conjugate momentum 
(angular momentum) 

= r^'d (LB. 26) 

is a constant of the motion. Obviously, a lot of examples of this type, well 
known in classical mechanics, can be given; however, there is a case which, 
while giving rise to a first integral of paramount importance, seems not to be 
included in the above description. Let us consider, in fact, a system described 
by a Lagrangian not explicitly containing the time 

c. = C{qi,q2,... ,qn',qi,q2,--- An)- (1.B.27) 



Thus 

which can be rewritten. 



dC dC . dC .. 

dt dqi dqi 

owing to (l.B.ll), as 



dt 



dC 



dq, 



-qi 



= 0, 



which is equivalent to saying 

-^qi - C= Pi4i - C = 'H = const. (1.B.28) 

^qi 

It is easy to check that, for a natural system, piqi (where the sum over i is 
implied) equals two times the kinetic energy, and therefore 



K = E = (total energy) = const. (1.B.29) 

The function H = piqt — C introduced in (1.B.28) is called Hamilton’s func- 
tion, or the Hamiltonian of the system; we refer the reader to Part C for 
all related matter; here we limit ourselves to pointing out the coincidence 
of H with the total energy for a natural system and the fact that the ab- 
sence of t from the Lagrangian entails the existence of a constant of the 
motion (the Hamiltonian). However, according to the definition given above, 
t is not an ignorable coordinate, being not a coordinate but the independent 
variable. We can circumvent this lack of symmetry between the coordinates 
qi {i = 1,2, .. . ,n) and the time t, by considering t = go as a coordinate 
and all the n + 1 coordinates defined in this way as functions of a suitable 
parameter w. In this case the action integral (1.B.12) will become 
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I = 




,qn;qi,q2,--- An)dt 



• 5 Qn 



Si 

t' 







(1.B.30) 



where the prime means derivation with respect to the parameter w. The 
Lagrangian is given now by Ct' and does not contain the coordinate t = qo 
(it contains, instead, Qq = t'). The conjugate momentum po = Pt will therefore 
be 



Po 



dt' ~ ^ dqit'^~ dqiV 



r 

^ C - —qi. 



Therefore, po = Pt — —T~i. In this new formulation, one replaces the configu- 
ration space Q by a new space with n + 1 dimensions, the space of events^^ 
defined as R x Q where the coordinates are t{— ^ 2 , • • • , ^n- At this 

point, having already said that the existence or otherwise of ignorable coor- 
dinates is in correspondence with the generalized coordinates one has chosen, 
it is proper to ask oneself if it is possible to give a rule for: (1) deciding if a 
system, through a choice of suitable coordinates, admits ignorable variables; 
(2) determining such coordinates. Unfortunately, neither of these points can, 
in general, be satisfied. It is only possible^^ to decide if a system admits n — 1 
ignorable variables, but not to determine the maximum number of ignorable 
variables which a given system admits. The fundamental problem remains of 
determining the appropriate coordinate system. We can introduce the prob- 
lem in this way. If in the Lagrangian C{q, q, t) no one of the Qi's is ignorable, 
the obvious consequence is that no generalized momentum is constant. How- 
ever, a transformation to a new system of generalized coordinates may exist, 
which we shall denote by such that C{Q,Q,t) (we go on 

using the same symbol C for the sake of simplicity) does not contain one of 
the Qi’s. Let Qi be this coordinate. Then 



Pi 



dc 

— 7- — Cl = const. 

dQi 



In terms of the old coordinates 



dc 






dQi 



dC d<jk 
dqk dQi ’ 



(1.B.31) 



(1.B.32) 



since C depends on the Q’s only through the g’s. Moreover, 

. _ dqk ^ , dqk 
dQi dt' 



'*See J. L. Synge: Classical Dynamics, Handbuch der Physik, III, 1 (Springer, 1960), 



p. 105 ff. 

'^^See J. L. Synge: On the geometry of the dynamics, Philos. Transactions (A) 226, 
31-106 (1927). 
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and then dqkjdQi = dqk/dQi. We have already used this property of “ef- 
facement of the point” in obtaining (1.B.13). We therefore get 



dC dqk 
dqk dQi 



= Cl = const. 



(1.B.33) 



The problem is that of finding a coordinate transformation 



qi = qi{Qi,Q2,--- ,Qn,t), i = l,2,...,n (l.B.34) 

such to satisfy (1.B.33). A transformation like (l.B.34) is called a point trans- 
formation. Rather than devoting oneself to pursuing single transformations, 
it is more fruitful to study families of transformations, and, as we shall see in 
the next section, to consider more general transformations which also involve 
generalized velocities. 



1.6 Noether’s Theorem 

The connection between the existence of conserved quantities and the be- 
haviour of a system under transformations of the variables has been stud- 
ied in a completely general way and the result is contained in a celebrated 
theorem enunciated by Emmy Noether in 1918.^® Traditionally, Noether’s 
theorem for discrete systems is given considering only point transformations 
(see Sect. 1.5). In this way, while the connections are highlighted between the 
symmetries of space (or space-time) and the existence of constants of motion 
such as energy and angular momentum, other quantities, which we know (by 
other means) to be conserved but not corresponding to purely point transfor- 
mations, are neglected. On the other hand, in the last two decades we have 
seen an increasing tendency to present Noether’s theorem in a more general 
way involving also generalized velocities. In the following we choose this 
last approach. Then we have to handle transformations which work in the 
space (t,q,q). We have seen in the preceding section how to pass from the 
configuration space Q to R x Q. Since Q is, in the language of differen- 
tial geometry, a differentiable manifold, the space of g’s and g’s in the same 
language is the tangent bundle of Q and is denoted by TQ. Therefore we 
shall work in R x TQ and shall consider transformations depending on a real 
parameter e: 

t = ip{t,q,q,e), = ipk {t,q,q,e), k=l,2,... ,n, (1.B.35) 

^®E. Noether: Nachr. Akad. Wiss. Gottingen, Math. Phys. KL. II, 235-257 (1918). 

English translation in Transport Theory and Statis. Physics 1, 186-207 (1971). 
^^See W. Sarlet, F. Cantrijn: Generalizations of Noether’s theorem in classical me- 
chanics, Siam Review 23, 467-494 (1981). 
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with the understanding that the transformation of the q^s is not independent, 
but comes from (1.B.35). 

Usually, the transformations which one encounters in applications make 
a group; here we consider only one-parameter Lie groups; nevertheless the 
results we shall explain do not require explicit use of the concept of group. 
The functions (p and must be at least in each of the 2n + 2 arguments 
and the parameter e will be such that £ G / C M, where I includes the origin 
as interior point. Moreover, 

V>{t,q,q,Q) = t, 'ipk{t,q,q,0) = Qk. (1.B.36) 

Owing to the hypotheses, we can expand (1.B.35) in Taylor series in the 
neighbourhood of £ = 0, taking into account only first-order terms in 

t=^t + T{t,q,q)e + o{e), Qk = Qk + (t,q,q) £ + o{e). (l.B.37a) 

The principal linear parts, r and oit and with respect to £ in 5 = 0 
are called infinitesimal generators of the transformations (p and ; they are 

r {t, g, q) = ^ {t, g, g, 0); ^k (t, g, g) = ^ {t, q, q, 0). 

The transformations defined by (l.B.37a) are the infinitesimal transforma- 
tions associated with (1.B.35). From now on we shall therefore consider the 
transformations 



t = t + T{t,q,q)e Qk = Qk + ^k {t,q,q) £, (l.B.37b) 

from which we shall also obtain the variations of the g’s; the derivatives f 
and will be evaluated on arbitrary curves t Q{t), with t G [a,b] and 
[a, b] C M. Through (l.B.37b), a curve t q{t) is transformed to t (7); 
to first order in 5 , we have 



dqj 

dt 



dq^ dt 
dt dt 



Qi + 

1 ef 



’ Qi + ^ 



(ii - QiT-) • 



(1.B.38) 



We shall say that the infinitesimal transformation (l.B.37b) leaves the action 
integral invariant up to gauge terms if a function / = f{t, q, q) exists such 
that, for any differentiable curve t ^ q{t) , one has 



rt2 

Jti 



t,q{t),iq{t) 



dt 



jC{t,q{t),q{t))dt + e / -^dt + o{s), 

(1.B.39) 

with [^ 1 ,^ 2 ] C [a^b]. Transforming the first integral back to the interval 
[U,^ 2 ], (1.B.39) becomes equivalent to 

df (t, q, g) 



t,q{t), W 



^ £ {t,q{t),q{t))+e - 



dt 



■ 0 ( 5 ), (l.B.40a) 



which must hold for a whole family of curves t ^ q{t) and therefore must 
be an identity in t^q,q. Developing and retaining only first-order terms: 
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(1 +ef) = C {t, q{t),q{t)) + £ 



dt' 



and also 



dC ^dC ^ dC ^ . df 
Sqi ^ Cer — £ —. 
at OQi oqi dt 



From (1.B.38) one has Sqi — £ {^i — qif). Moreover, taking into account that 

dr dr . dr .. 

at dqi dqi 



• . dii .. 



we have 

^ ^ ^ 

dt^ ^ dqi " dqi 



d^i d^i . .. . f dr dr , 



dr 

di 



■Qj 



+£ 



9r dr , dr .. 

dt dqi dqi 



df ^df .df .. 
dt dqi dqi 



(l.B.40b) 



Now, (LB. 40b) is an identity in t,q,q and q, which means that the coeffi- 
cients of (ji have to vanish separately, yielding the n -|- 1 equations: 



dqi ^ dqj dqi^^ ) dqi' 

dt ^ dqi ^ dqi dt dqj 



f dr dr , 
\ 9t dqj 



+£ 



dr dr . 
dt dqi 



dt 




(1.B.41) 



(1.B.42) 



(1.B.41) and (1.B.42) are given by n + 1 partial differential equations, linear 
in the n + 1 unknown functions r and and represent the necessary and 
sufficient conditions for the action integral to be invariant up to gauge terms 
under the infinitesimal transformations with generators r and Equations 
(1.B.41) and (1.B.42) are called generalized Killing equations. Let us see 
now what the consequence is for the system of the invariance of the action 
integral under the transformations we have considered. Go back to (l.B.40b) 
and assume that, in it, the g^i(t)’s are solutions of the equations of motion, 
i.e. satisfy the Lagrange equations with the Lagrangian C. 



^^It is easy to show that, when C = \gkiq^q^ (a system of free mass points) and 
one considers a transformation t — t, q^ = q^ (LB. 41) and (1.B.42) 

are replaced by gik{d^^ /dq^) + gki{d^^ / dq'^) = 0, which are just the Killing equa- 
tions. See, for instance, L. P. Eisenhart: Continuous Groups of Transformations 
(Princeton University Press, 1933) p. 208. 
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By using the identities 

dC dC dC . dC .. 

dt ^ dt ^ dqi dqi 

dqi dt dt / dt \dqi) 

dC . dr dC .. d [ dC . \ d f dC\ . 

dqi dt dqi dt \dqi ) dt 

one obtains 

dt ^ dqi^ dqi ^ ^ dqi dt\dqi / dt dqi ^ 



dC.. d fdC \ d dC . dr 
d . _ X d dC .. .X d / . . J d dC .. . . df 

= + S%(«* “ 5%<«‘ 

That is, finally 



Therefore, if we have a system with Lagrangian £, the invariance up to gauge 
terms of the action integral under the transformation with generators r and 
^i has as a consequence that the quantity in square brackets in (1.B.43) is 
an integral of the motion. 

This conclusion constitutes the Noether’s theorem. Conversely, owing to 
what we have already demonstrated, we can say that a necessary and suf- 
ficient condition for a transformation to be Noetherian^ i.e., given a system 
with Lagrangian £, the quantity in the square brackets in (1.B.43) be a 
constant of motion, is that its generators satisfy the generalized Killing equa- 
tions (1.B.41) and (1.B.42). So far as these equations are concerned, it is clear 
that they can be used in two ways which are, so to speak, complementary. 
If we start with a given Lagrangian C and a transformation with generators 
T and (1.B.41) and (1.B.42) enable us to check if the transformation is 
Noetherian and then, from (1.B.43), to obtain the corresponding constant of 
motion. Conversely, if a transformation is given with known generators r and 

(1.B.41) and (1.B.42) can be used to single out the set of Lagrangians 
enjoying the property of invariance under the given transformation. In the 
first case, it may also happen that one has the Lagrangian C and wants to 
determine all the possible Noetherian transformations. This is the case in 
which we are interested. It being impossible, for obvious reasons, to deter- 
mine a complete integral of (1.B.41) and (1.B.42), in this case one will be 
obliged to make systematic attempts starting from the simplest and obvious 
cases. This will be done in the next section, where we consider a particularly 
meaningful system, the n- dimensional oscillator. 
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Before we pass on to the applications, it is interesting to devote our atten- 
tion to the possibility of extending Noether’s theorem also to non-conservative 
systems and at last to the inverse of the theorem itself. When a system is acted 
on by non-conservative forces of generalized components (z = 1, 2, . . . , n), 
the Lagrangian equations^^ become 



dt dqi dqi 



(l.B.44a) 



where C represents the Lagrangian corresponding to the conservative forces. 
Let us write (l.B.44a) in the form 



^ _ d dC dC ^ ^ 

L(i) = — — — ^ Qi = 0. 

dt dqi dqi 

and consider infinitesimal transformations 



(l.B.44b) 



t = t + er {t,q,q), 
^ ^ r ■ 

dt dt 



Qi + 

Qif + ipi {t,q,q) \ , 



(1.B.45) 



where, unlike in (1.B.39), the transformations of the generalized velocities are 
not determined by those of the coordinates and of the time, but depend on 
n functions (fi (which we assume to be C^) associated with the presence of 
the non-conservative forces. Let us now assume that transformations (1.B.45) 
leave the action invariant up to gauge terms, that is, as in (l.B.40a), 



«(<), I . ( 



dt 

dt 






df {t, q, q) 
dt 



+ o(e) 



holds. Expanding this in the usual manner and retaining only first order 
terms, we now have 



dqi * ^ dq 
which can be rewritten 



: ^ OC . . \ dC 



dt ’ 



(l.B.46a) 



d 

dt 



+ {U - QiT) 



dqi 



d£ It- ■ \ If • \ T df 

+ -^q>i - Qi {t,i - qiT) - (6 - qiT) Lf^i) = — . 



(l.B.46b) 



Following on from these premises, we can now state the extended Noether’s 
theorem: 



If the action integral is invariant up to gauge terms under the transfor- 
mations (1.B.45) and if, moreover, the identity 

^®See, for instance, H. Goldstein: Classical Mechanics, 2nd edn. (Addison- Wesley, 
1980) Chap. I. 
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^<P^ = Qi{i^-qiT) (1.B.47) 

holds, then the first integral 

/ = £r + - qiT) - / (1.B.48) 

dqi 

exists. 



The proof is quite obvious; in fact, it is sufficient to substitute (l.B.44b) 
and (1.B.47) in (l.B.46b). The relation providing the first integral, as can 
be seen, retains the form (1.B.43). By substituting (1.B.47) in (l.B.46a), we 
have 



- . dC dC . dC ( • . \ , . . 

Ct + — T + ~ + — [ii - qiTj + Qi{ii - qiT) = 



dt 



dqi 



dt 



and, by developing the total derivatives. 



C 

dqi \ dt 



dr dr . dr dC dC 

dt dqi dqi ^ dt^ ^ dqi 



^dii . dii .. 
dqj dqj 



qi 



dr 



dr 



dr 






^ . df df . df .. 

Since it must hold for any qAA^ if will be equivalent to the system of n + 1 
partial differential equations: 



dr dC f d^j dr . 

dqi dqj \ dqi dqi 



dqi 



{i = 1,2,... ,n). 



dc 

dt' 



dqi * dqi 



dji , di 

dt 



+ 



.( dr dr 



dqj ^^\dt dq. 



Qj 



+■^1 1 + 1 ;*) + = §f + 



(1.B.49) 



(1.B.50) 



Equations (1.B.49), (1.B.50) are the generalized Killing equations for the non- 
conservative systems. As can be seen, they differ from (1.B.41) and (1.B.42) 
only for the presence of the term Qi{^i - qir) in the (n + l)th equation. Also 
in this case, the Killing equations represent necessary and sufficient condition 
for the invariance of the action up to gauge terms and therefore, owing to 
Noether’s theorem, for the existence of a first integral. 

Returning to the case of conservative systems, let us see if Noether’s 
theorem has an inverse and, if so, under what conditions. Let us consider again 
the expression for the first integral (1.B.48). If we differentiate it with respect 
to qj and substitute (1.B.41) in the obtained expression, we get dl/dqj = 
Hij {^i - qir), where 




1.6 Noether’s Theorem 



49 



\Hij\ 



d'^C 

dqidqj 



is the Hessian determinant we have already assumed to be nonvanishing. If 
we call the elements of the inverse determinant, that is, 
finally have 

f) T 

= Jij ^ + qiT. (1.B.51) 

dqj 

In most cases concerning our applications, the Lagrangian contains the q^s 
through ^SijQiqj; therefore Jij = 6ij and (LB. 51) becomes 



^ a/ . 

dqi 



If, now, we substitute (1.B.51) in (1.B.48), we shall have for r 



At this point, we can state the inverse of Noether’s theorem: 

To any constant of motion I — g, t) for the dynamical system de- 
scribed by the Lagrangian C there correspond infinitesimal transforma- 
tions (1.B.51), (1.B.52) which leave the action integral invariant up to 
“gauge” terms, in the sense stated in (l.B.40b), for all the solutions 
t ^ q(t) of the equations of motion. 

As the function / = f{q, q, t), in (LB. 52) is quite arbitrary, it also follows 
that the function r = r(gf, q, t) is arbitrary, and thus (LB. 51) does not define 
a coordinate transformation but a family of coordinate transformations. This 
consequence, which a priori seems to introduce a troublesome arbitrariness, 
as a matter of fact enables us to “simplify” the transformation corresponding 
to a given first integral in the most convenient way. To this end, we state an 
easily applied proposition for real cases. 

Let us assume we have a first integral / = / (g, g, t) relative to the sys- 
tem described by the Lagrangian C and, moreover, this integral, through 
Noether’s theorem, corresponds to the transformation defined by 



'TSO, (1.B.53) 

with / = f{q,q,t)-, then, as a consequence, an infinity of transformations 
like 

^i=^i + Tqi, r = T{q,q,t), i = l,2,...,n (1.B.54) 

are defined, with / = / -h (r is a quite arbitrary function), corresponding 
to some integral /. We omit the proof, which consists of a simple check. It 
is also easy to verify that the inverse theorem and the above proposition can 
be extended to non- conservative systems. 
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1.7 An Application of Noether’s Theorem: 
The n-Dimensional Oscillator 



We have studied the n-dimensional oscillator (both isotropic and anisotropic) 
in the framework of Newtonian dynamics. This system is particularly impor- 
tant; in a certain sense it is the unique meaningful model of an n-dimensional 
physical system which is integrable for any n. In any case it is of great im- 
portance as a term of comparison. 

Before continuing with the study of the oscillator, we should note that, 
when in (1.B.48) one has / = 0, one speaks of absolute invariance and of 
symmetry of the Lagrangian, and when / 7^ 0, of invariance up to gauge terms 
and quasi- symmetry. For the oscillator we shall have both cases. Since in the 
forthcoming applications we shall have to handle Lagrangians not explicitly 
dependent on time, C — C{q,q), we shall limit ourselves to transformations 
with 

r^T{q,q), f = f{q,q)- (1.B.55) 

Consequently, the generalized Killing equations become 



* dqi ^ dqi 



dCi 



dr 






^dT . df . 
+ C—qi = —qi, 



Ot dC d^j . dr 

dqi^ dqj dqi dqi 



dqi 



i,j = 1,2, 



(1.B.56) 

(1.B.57) 



The n-dimensional Isotropic Oscillator 

Consider first an n-dimensional isotropic oscillator (of frequency u and unit 



mass). The Lagrangian will be 




^ 1 . . 1 2 


(1.B.58) 


and the equations of motion 




qi+Lj'^qi = 0 i = l,2,...,n. 


(1.B.59) 



As we have already emphasized, we cannot substitute (1.B.58) in (1.B.56) 
and (1.B.57) and try to solve the n -h 1 generalized Killing equations, to 
obtain all the possible solutions and r. Nevertheless, we can equally well 
use equations (1.B.57) and (1.B.56) for a systematic exploration of the first 
integrals corresponding to the “simplest” transformations. 

a) Then let us begin by considering a transformation which is a pure “time 
translation”, and of the simplest type, i.e. = 0, Vi, and r = const. One 
can immediately check that, in this case, (1.B.56) and (1.B.57) are identically 
satisfied with / = 0; r being an arbitrary constant, we shall choose it equal 
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to —1 for the sake of simplicity. Then t = t — Qi = From (1.B.48) the 
corresponding first integral is 

dC 

-C+ — {qi) = qiqi - C = E = const, (1.B.60) 

where E is the total energy of the oscillator. Therefore, the conservation of 
energy corresponds to the invariance of the Lagrangian under time trans- 
lations. This is a more formal way of saying that the time is an ignorable 
coordinate. 

b) If we consider an infinitesimal space translation along the direction 
given by a vector e (arbitrary unit vector), then 

6qi=eei, = ei. (1.B.61) 

It is easy to check that the transformation could be Noetherian only if 

QiCi = 0, (1.B.62) 

that is gf • e = 0. It not being possible, in general, to satisfy this condition, 
the transformation (1.B.61) cannot be Noetherian and so the Lagrangian of 
the oscillator is not invariant under space translations. 

c) Another simple case may be that of a space transformation depending 
on the g’s: the simplest one is when the dependence is linear, i.e.: 

ii=uJijqj, r = 0, (1.B.63) 

with a matrix u? having constant elements. Let us see, by substituting (1.B.63) 
in (1.B.56) and (1.B.57), what are the conditions, if any, for the matrix oj. 
From (1.B.56), with. / = 0, one obtains 

^ijiiiqj - qiqj) = 0. (i.B.64) 

Therefore the equation is identically satisfied under the condition that 
cjij = — cjji, i.e. uj = Then the transformation is such that q^ = 

qi + euJijqj^ t = t, so that ^ij = ~^ji and the corresponding 

first integral is 

(jJij qiqj = const. (1.B.65) 

In an n- dimensional space, Sqi = e ujij qj with uJij = —ooji represents an 
infinitesimal rotation: since there are n (n — l)/2 linearly independent anti- 
symmetrical n X n matrices, (1.B.65) represents a set of n (n — l)/2 linearly 
independent first integrals. In the case n = 3, (1.B.65) yields r x f = const, 
that is, the conservation of angular momentum. 

d) Finally let us consider the possibility of a spatial transformation de- 
pending on velocities: 

T = 0, ^i = aijqj, (1.B.66) 

where a is a matrix with constant elements. By substituting (I.B.66) in 
(1.B.56) and (1.B.57), we obtain 
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-Lu‘^ aijQiqj ^ i = l,2, (1.B.67) 

We have (1.B.67) identically satisfied if we assume 

/ {q, = \ Qj - ^ Qi Qj, (l.B.68a) 

with Qij = aji^ i.e. a — . Then the transformation which turns out to be 

Noetherian is 

t = t, q.=qi-\- = aijqj, aij = aji. (1.B.69) 

The corresponding integral is aij qiqj — /; that is 

^ dij {Mj + ^^QiQj) = const. (LB. 70) 

Before entering into a discussion about the physical meaning of (l.B. 70) let us 
try to understand what type of transformation (l.B. 69) represent. Let us refer 
to T(5, i.e. to the space of the q's and g’s. Remembering that from (l.B. 38) 
one has dqi = e{^i — qif), we have in TQ that 5qi = eaij qj, Sqi = eaij qj. 
From the equations of motion (l.B. 59), Sqi = —euP' aij qj. If, as in Sect. 1.3, 
we perform a change of scale defined by 

qi=uJVi, (l.B. 71) 

we obtain 

Sqi = euaijVj, Svi = —euoaijqj. (l.B. 72) 

Therefore the Noetherian transformation (l.B. 69) can be interpreted as a 
rotation of the two subspaces of the qi^ and Vi^ as a whole around an axis 
orthogonal to TQ. The first integral (l.B. 70), therefore, corresponds to a 
rotational symmetry of the space of the g’s and g’s: it is a symmetry which 
was not displayed in the traditional versions of Noether’s theorem and was 
considered a “hidden symmetry” . Let us consider now (l.B. 70) from the point 
of view of its physical meaning. First of all, it must be pointed out that, since 
n{n + l)/2 is the number of components of a second-order symmetric tensor 
and since also the a^j’s are arbitrary constants, (l.B. 70) states the existence 
of the n{n -h l)/2 first integrals: 

1 {qiQj + J^qiQj) = Aij = const; = Aji- (l.B. 73) 

For a better understanding of the implications of (l.B. 73), it is convenient to 
consider the already familiar case n = 2 (planar oscillator). In this case, Aij 
will have three independent components: 

Ai 2 — A21 = - (91^2 + • 
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We recognize immediately that An = Ei and A22 — £’2, where Ei and E2 
are the energies of the two oscillations; moreover, the trace of the tensor A{j 
is A = All T ^22 = E^ the total energy, and the determinant 

|A| = A11A22 — A12A21 = (^1^2 — Q2Qi)^ = ^ 

where L is the angular momentum. As to A12 = A21, if we start from the 
solutions (which we rewrite in a slightly different way from those in Sect. 1.3) 

qi = a cos (jj{t — to), Q2 = hcos [uj (t — to) + S], (1.B.74) 

where S is the initial phase difference and to the initial time, we have 

^12 = ^ (91^2 + w^gig'2) = ^co'^abcosS. (1.B.75) 

Therefore A12 is connected to the initial phase difference; we have already 
learned in Sect. 1.3 that it is just this phase difference which determines the 
orientation of the orbit: this is the meaning of A12. We also learned that for 
5 = 7 t /2 we have an ellipse with the axes coinciding with the directions of the 
qi and q2 axes: in this case A12 = 0 and the tensor Aij is reduced to diagonal 
form with qi and q2 as principal axes. 

At this point, let us see what relations exist among the integrals we have 
found, since they surely exist in excess. In fact, from (1.B.74) it is evident 
that we may have at most three independent first integrals (corresponding 
to the constants a, b and J), apart from to (a constant always present in 
the case of autonomous systems and connected with the arbitrariness of the 
origin of times); we have met five integrals: total energy, angular momentum 
and the three independent components of Aij. Thence two relations among 
them must exist: one is obviously given by An + A22 == E and the other, 
which we have already seen when we evaluated the determinant of A, can be 
rewritten as 

n - 1 = (^n)' + {A 22 f + 2 (Ai 2 f • (1.B.76) 

The Anisotropic Oscillator 

We already know from Sect. 1.3 that, when the frequencies are different and 
not commensurable, the system consists of n uncoupled oscillations without 
any mutual relation: therefore no first integral related to the system as a 
whole exists, apart from the total energy owing to the additivity property 
of the energy itself. Therefore we have to study only the case of commen- 
surable frequencies. In this case an element exists which is common to all 
oscillations: the period; in fact, as we have seen, when the frequencies are 
commensurable, a period exists (a multiple of every single period through a 
different coefficient) which is common to the whole system. In other words. 
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the anisotropic oscillator with commensurable frequencies is a periodic, and 
not a multiperiodic, system. We expect that this fact entails the existence of 
first integrals (one or more) concerning the system as a whole. 

The Lagrangian will be 

^ = (I-B.77) 

i=l 

If we investigate if there are the same integrals of the isotropic case, we realize 
immediately that the existence of different frequencies makes transformation 
(1.B.63) no longer Noetherian ((1.B.56) cannot be satisfied) and then the 
angular momentum is not conserved. Moreover, (1.B.66) in its general form 
is also no longer Noetherian. Only the diagonal elements remain and the 
transformation which still holds is 

= an Qi (no sum over i), (1.B.78) 

and ^ 

fiQ, 9) = ^ ^ an {<ii - (l.B. 68 b) 

i=l 

Consequently, we have the integrals An (namely the energies of the individual 
modes of oscillation) but not the non-diagonal ones Aij . On the other hand it 
is easy enough to realize that the isotropy of the space TQ which led us to the 
interpretation provided by (1.B.72) is now removed owing to the existence 
of different frequencies, which prevents us from arriving at (1.B.72) through 
(1.B.71) (the different frequencies are “mixed” performing the transforma- 
tion). Therefore we shall now have to investigate the nature of transforma- 
tions, different from (1.B.66), corresponding to the integrals which certainly 
exist (we know, in fact, that closed trajectories correspond to commensu- 
rable frequencies). For the sake of simplicity, let us limit ourselves to the 
two-dimensional case (planar anisotropic oscillator). Then 

1 ( 4 ? + qI) - ^ (wf 9i + grf) . (1.B.79) 

With oui and LO2 being commensurable we can, without loss of generality, put 
= nuji — nuj^ with n > 0. Let us start from the parametric equations of 
the trajectory: 

qi = acosuo {t - to), Q2 = bcos[nuj (t - to) + 5 ], ( 1 .B. 80 ) 

As El — E2 = ^ are the two energies, we have only to look 

for a third integral related to the phase difference 5. Differentiating (1.B.80) 
one gets 

= -uj a sin Lu (t - to), ^2 = -nu;bsm[nu; (t - to) + S]. (1.B.81) 

From (1.B.80), one has 

X ^2 Qi 

0 = arccos n arccos — , 

b 



a 



(1.B.82) 
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and, using (1.B.81), 



cos ^ arccos 



cos 6 



Q2 



Q 2 . f qi\ 

sm n arccos — . 

ri(jjb \ a / 


(1.B.83) 


be rewritten as 




-U 

) n^co^ab ^\a)' 


(1.B.84) 



where T^(x) = cos(n arccos x) is the Chebyshev polynomial of the first kind 
and degree n and T^(x) its derivative with respect to the argument x. It is 
easy now to check, even if calculations are a little cumbersome, that to every 
integer n a first integral corresponds: 



I„ = (2Ei)"/2(2£;2)1/2cos(5. (1.B.85) 



For n = 1 , one obtains 2 Ai 2 , where A 12 is the integral (1.B.75). Then (1.B.85) 
and (1.B.84) enable us, for any integer n, to express the corresponding first 
integral. From a formal point of view, we can also consider In as a scalar 
product A- B between two vectors of moduli \ Af = (2Ei)'^ and \Bf = 2 E 2 . 
That is, 

T A.2 = (2£^i)^, Bi = B2=riLuq2. 

For n = 2, we have 

I 2 = {2Ei)\/2E2 cos 5 = (2ujqi qi) q2 + - q\)2uq2, (1.B.86) 



and, for n = 3, 

/s = {2E\)^^‘^ y/ 2 E 2 cos 5 = (^q\ ~ ^ 1)92 T ~ 3c<;92- 

(1.B.87) 

It remains, now, to consider the case of n given by a non-integer rational 
number. One can check that (1.B.85) also holds in this case, but it is no longer 
possible to express cos (5 in a compact form as in (1.B.84). In the simplest 
case, n = 3/2, by means of the formulae of elementary trigonometry one 
obtains 

^ 3/2 — (2F'i)^/^(2F^2)^^^ cos 6 = 6 / 2 A 1 + — 92 A 2 , (1.B.88) 

with 

Ai,2 = T T ± 3u;gi qf ) . 

Obviously, Af -h A| = = ( 2 F^i)^/^. 

We recall that, forn = 2 and n = 3/2, the trajectories for various values of 
S have been given in Sect. 1.3. To sum up, we can say that, for the anisotropic 
case also the third integral exists and is given by a one-valued function of 
q and q (the integral contains cos 5 and not simply 5). We have now to go 
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back again to Noether’s theorem and to realize what are the transformations 
corresponding to the integrals we have found. From examples (1.B.86) and 
(1.B.87) it is evident that now the integral proportional to cos J is no longer, 
like Ai 2 in the isotropic case, a second-degree polynomial in the variables 
Qi^ Q 2 ^ Q 2 but on the contrary a polynomial whose degree is increasing with 
n{n = LU 2 /UJ 1 )’’, the situation becomes even more complicated when n is not an 
integer because of the appearance of roots, as one can see from the example 
(1.B.88). Therefore, we shall no longer have any single transformation like 
(1.B.66), but different transformations corresponding to different values of n 
(and of increasing complexity with increasing n). In the simplest case (n = 2), 
we are still able to proceed tentatively in search of the correct transformation. 
The structure exhibited by integral (1.B.86) suggests trying a transformation 
like 

= AijkqjQk, r = 0, i, j. A: = 1, 2, (1.B.89) 

where the components of Aijk are some constants. In this case (1.B.56) and 
(1.B.57) become 



^"dqi dqidq^^ 



dq^ dqi 



dqi 



(1.B.90) 

(1.B.91) 



Substituting (1.B.89) in (1.B.90) and taking into account that now £ = 
+ ^ 2 ) ~ + 4 ^ 2)5 obtains (for the sake of brevity we omit the 

calculations) for / the expression 



f = A {qiqici2 - qiql - ‘^‘^q\q2) 

and for the components of Aijk the values 

A121 = ^211 = A^ A112 = — 2A, 

^111 = ^221 = ^122 = ^222 = ^212 = 0 , 



(1.B.92) 



(1.B.93) 



where A is an arbitrary constant which can be put equal to 1. The wanted 
integral will then be 

d JC/ 

(1.5.94) 

- - q2q\ - <^'^q\(i2) = -q\q2 + qiM2 + io‘^qlq2- 

We can check immediately, by comparing with (1.B.86), that I = I 2 I 2 U and 
that the integral in (1.B.94) corresponds to the transformation generated by 
= —2^2^1+9192, ^2 = qiqi, T = 0. From (1.B.89) and (1.B.90) it is evident 
that the transformations we have to handle now are no longer identifiable as 
rotations, translations with constant coefficients, etc., but are more complex. 
We can rewrite the generators in the form 
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6 = -9i?2 + - 929i) , 6 = 9191, (1.B.95) 

and interpret and ^2 as generators of a (differential) rotation in the qiQ 2 
plane (with coefficients given by Qi) and of a translation along qi of an amount 
given by the angular momentum qiq 2 — q 2 qi (which in this case is not con- 
stant). Anyhow they are no longer “simple” transformations involving the 
whole space of the ^’s and ^’s. Nevertheless, the method we have adopted 
has enabled us all the same to obtain integral (1.B.94), which is a polynomial 
(in the g’s and ^’s) of a higher degree than A 12 . 

The success achieved so far, however, does not mean that the method 
works in every case. When the Lagrangian of the system has a relatively sim- 
ple expression in rectangular coordinates (as in the above case) the method 
can always be successfully applied. In fact, with rectangular coordinates, the 
“simplest” transformations are with constant coefficients and the procedure 
is unambiguous. On the other hand, when the Lagrangian is given in curvi- 
linear coordinates, the transformations (even the simplest ones) do not in 
general have constant coefficients, and then the choice is to a great extent 
arbitrary. 



1.8 The Principle of Least Action in Jacobi Form 



For a conservative system, Hamilton’s principle assumes a particularly mean- 
ingful form due to Jacobi. 

In the sixth of his celebrated lectures on dynamics, Jacobi proposed 
eliminating from Hamilton’s principle any feature which could make one think 
of a metaphysical cause^ not considering himself satisfied with the formulation 
given by Lagrange and Hamilton, which, in his opinion, still keeping the 
presence of the time variable, could allow a finalistic interpretation. In the 
Jacobi formulation, as we shall see, the principle is, so to speak, geometrized. 

Let us then consider a conservative natural system (see Sect. 1.5). The 
action integral will be 



I = 








As the system is conservative. 



(l.B.96a) 



H = T~L{q,p) = E = const, (1.B.97) 



and therefore (l.B.96a) will become 



G. J. Jacobi: Vorlesungen Uher Dynamik^ 2nd rev. edn. (Reiner, Berlin, 1884), 
pp. 43-51. (Reprinted by Chelsea, New York, 1969). 
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Pkdqk - {t2 - ti)E. (l.B.96b) 



1= f{T.P>^dq,-ndt] = f^Y: 

dtl \k=l J k=l 



If we now consider any two neighbouring curves Fi and l 2 with the same 
end points and with tangents satisfying (1.B.97), we have 



/(A) = y Y2Pi^d.qk + {ti-t2)E, 

p Th 

^(A) = J YlP’^dqk + ih - 12) E, 



and then 



<57 = I{E2) - /(A) = 






Pkdqk- 



On the other hand, owing to the Hamilton principle, we have 61 = 0 and 
thus 






Pkdqk = 0. 



(1.B.98) 



The obtained result states that, in the Q space, the trajectories of the sys- 
tem satisfy the variational equation (1.B.98) with supplementary condition 
(1.B.97). It is understood that the limits of integration are fixed in Q. 

Let us now define the reduced action S'* (Maupertuis action): 

s* = j Pkdqk, 

where pk and qk satisfy (1.B.97). Recalling the definition pk = dCjdqk, we 

Recall also that for a natural system (see Sect. 1.5), 

/ — 2^ikqiqk- 

Moreover, V — V{q) is the potential energy and C — T—V (q) the Lagrangian. 
Accordingly, we have 

dC . 



Therefore the reduced action becomes 



S* = 2 Tdt, 



(1.B.99) 
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and the variational principle assumes the form 

SS* =S J 2Tdt = 0, (l.B.lOO) 

with H — E = 0 and, consequently, T = E — V. 

Now, following Jacobi, it remains to eliminate the time. Let us refer again 
to the (kinematical) line element in the configuration space introduced in 
Sect. 1.4. From ds^ = 2Tdt^, we have 



dt 



ds 

vw 



(l.B.lOl) 



and therefore 

2Tdt = VWds = y/2{E - V) ds. 

Neglecting the factor \/2, we can rewrite principle (l.B.lOO) in the form 



(5 




-Vds = 0. 



(1.B.102) 



It must be remembered that ds in (1.B.102) is, as is obvious, the square 
root of a quadratic differential form and not the exact differential of some 
function. 

Finally, if we define the action line element dsi by 

ds\ = {E - V) ds^ , (1.B.103) 



the variational principle will be completely geometrized and will take the 
form 

S J dsi= 0. (LB. 104) 

This means that the solution of the problem of the motion of a conservative 
natural system leads to the search for the geodesic of a Riemannian mani- 
fold whose metric is given by (1.B.103). The time no longer appears in this 
formulation; (1.B.104) determines the path of the representative point in the 
configuration space, not the motion in time. 

For this part of the problem, one will have to integrate (l.B.lOl). More- 
over, it must be said that the equilibrium states of the system are excluded 
from this formulation; in fact, in this case E = V, and then the metric 
(1.B.103) becomes singular. When the system consists of only one particle 
moving under a potential U = -V, the line element ds is simply the line 
element of the ordinary three-dimensional space in arbitrary curvilinear co- 
ordinates. With regard to (1.B.103), we can further remark that the metric 
dsi is obtained from ds by multiplying by a factor depending only on the q's 

^^The phrases “kinematic line element” and “action line element” are due to 
J. L. Synge, see Handbuch der Physik, III, 1 (Springer, 1960) and the paper al- 
ready quoted. 
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and not ^’s, i.e. only on the “position” and not on the “direction” . Therefore 
we are in the presence of a conformal transformation and thence in passing 
from one metric to another, the angles are conserved. As to the length of 
any arc of curve, it decreases more and more for kinetic energies tending to 
zero, untill vanishing for T = 0; in fact, we have already remarked that, for 
E = V^ the metric dsi becomes singular. 

Lastly, a comment regarding the term “least action” : we have preserved 
it in homage to tradition, but more correctly one must say stationary action, 
since it is required that the action integral be stationary and not necessarily 
minimum. The same, from the geometrical point of view, is true for the 
geodesics. As Jacobi^^ had already pointed out, if we consider, for instance, 
the geodesics on the sphere, the geodesic consisting of an arc of a great circle 
joining two points A and C corresponding to an angular distance greater 
than 7T is not the shortest distance. Therefore we can speak of a minimum 
only when we are comparing two neighbouring curves. We take from Jacobi 
Fig. 1.15. Let B be the diametrically opposed point to a given point A. The 
great circles joining A and B obviously have all the same length; then, if 
AaBC lies on a great circle, we have 

AaBC = Af3B + BC = A0 ^ pB + BC. 

If /3 is a point close to B, let us draw the arc of a great circle joining (3 
with C. Then: 

(3C <!3B + BC. 

Thus AfS + (3C is smaller than the arc of great circle AaBC. 



A 




^^See Jacobi, op. cit., p. 46. 
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As we have seen, in the Lagrangian formulation generalized coordinates 
gi, Q 2 , ... , have been introduced, for a system having n degrees of free- 
dom, and their time derivatives enter explicitly in the Lagrangian besides the 
^’s themselves. Nevertheless, the equations of motion are n differential equa- 
tions of the second order in the unknown functions qi{t). The further step of 
passing to 2n first-order differential equations was taken in the Hamiltonian 
formulation of the dynamics. We have already met in (1.B.14) the definition 
of the momentum conjugate to the coordinate 

dC 

Pi = ^ (* == 1,2, ... ,n), (l.C.l) 

and in (1.B.28), the definition of the Hamiltonian, 

'H = PiQi - C. (l.C.2a) 



In (1.B.28) T~L appeared as a constant of motion when in the Lagrangian the 
time was not explicitly contained; therefore the Hamiltonian also did not 
contain the time explicitly. Now, on the other hand, we consider (l.C.2a) 
a general definition with H and C also containing, for all eventualities, the 
time in an explicit way. Relation (l.C.2a) which contains H and C in a, 
so to speak, symmetrical role, represents a Legendre transformation. If, for 
the sake of simplicity, we limit ourselves to only two variables, a Legendre 
transformation transforms a function f{x,y) into another one g{x,z) with 
2 : = Ofidy and such that dgjdz — y. In our case f{x,y) = C{q^q) and 
g{x,z) = n{q,p). 

Now, differentiating (l.C.2a), we have 



dn = qkdpk +Pk dqk 



dC ^ dC dC ^ 
^ dqk - -^dqk - — dt. 
dqk dqk dt 



Owing to (l.C.l), the coefficients of dqk mutually cancel and then 



m _ m 

dqk dq^ ’ dp^ 



Exploiting again Lagrange’s equations and (l.C.l), one obtains dH/dq^^ — 
—pkj and then the system 

dLL dl~L 

= A = «,= 1,2, (1.C.3) 

Equations (1.C.3) are called Hamilton's equations^ or canonical equations. 
Summarizing: the function H is, in general, a function of the qk^ Pk and the 
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time; system (1.C.3) consists of 2n first-order equations; the conjugate vari- 
ables Pk do not necessarily correspond to coordinates and components of 
momenta, but may be completely general. Every time it is possible to char- 
acterize the state of a system by means of n generalized coordinates Qk and 
n conjugate momenta, given by (l.C.l), obeying (1.C.3), one says that the 
system is a Hamiltonian dynamical system: the particular structure of (1.C.3) 
distinguishes it among the systems of 2n first-order differential equations. 

Hamilton’s equations can also be obtained directly from the variational 
principle, as we did for Lagrange’s ones. In fact, owing to the duality of the 
Legendre transformations, we can start from the Hamiltonian to obtain the 
Lagrangian 

C^p,q,-n (l.C.2b) 

and write down the action integral in the form 



{PiQi - Wjdt. 



(1.C.4) 



Since the second term in (l.C.2b) does not contain the derivatives of the p^’s, 
these ones do not even appear in the integrand of the variational principle 
and therefore the variation of I has the same conditions we have already fixed 
for the The variational problem, now, has 2n variables and the resulting 
equations are 

dt dqi dqi ~ dt dqi ’ ^ r^ 

(l.L.oj 

d dc dc . on ^ 

-77 — = 0 — qi-h — — — 0 , 

dt dpi dpi dpi 

which are just the canonical equations (1.C.3) obtained directly from the 
variational principle. 

The Hamiltonian system (1.C.3), owing to its peculiar structure, can be 
written in a form, called symplectic, which employs the matrix formalism; this 
form will be useful in applications. For a system with n degrees of freedom, 
one defines a column vector z, with 2n components, such that 

~ Pi 2, • • • , Tl) (1.C.6) 

and the columm vector dH/dz, also with 2n components, such that 



Lastly, one defines the 2n x 2n square matrix 

■'=(-1 i)- 



(1.C.7) 



(1.C.8) 
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where 0 is the n x n matrix with vanishing elements and 1 is the n x n identity 
matrix. Then it is easy to verify that system (1.C.3) can be compactified in 
the form 



( 

I 

II 

133I 


(1.C.9) 


Moreover the matrix J has the following properties: 




= -1, 


(I.C.IO) 


II 

M 

II 

1 

II 

1 


(l.C.ll) 


|J| = l. 


(1.C.12) 


Returning to the usual formalism, if we evaluate the 
d'Hjdt^ we obtain 

dU dU . dU . dn 

dt dq^ dpj^ dt 


total derivative 


and, because of (1.C.3), 

dn _ dn 

dt dt 


(1.C.13) 


When n — n{q^p), that is, the Hamiltonian does not depend explicitly on 
time, (1.C.13) gives dn/dt = 0 and then 


q2, ■ ■ ■ , qn, Pi, P2, ■■■ , Pn) = const. 


(1.C.14) 



Obviously, (1.C.14) is a first integral of system (1.C.3) and the result is not 
unexpected, since we have already obtained that 1~L = const, when C does 
not contain the time: in this case, the mere inspection of (l.C.2a) provides 
(1.C.14). Nevertheless, it is important, for the forthcoming applications, to 
clarify the implications of the first integral (1.C.14). To do this, it is necessary 
to deduce what type of function (of the ^’s and p’s) the Hamiltonian may be. 

As regards the kinetic energy T, we have already said in Sect. 1.5 that 
for natural systems it is a quadratic form homogeneous in the ^’s, indicated 
by 72 , whereas for non-natural systems it may have also terms linear in the 
q’s or terms not containing the q’s at all. It happens that a system is, for 
instance, a natural one when considered in an inertial reference frame whereas 
it becomes non-natural if we pass to a rotating frame. Therefore, in general, 
T = To + 7i -h ? 2 , where Tn means a homogeneous function of degree n in 
the g’s (which may also depend on the g’s). Owing to the Euler theorem, 

nTn = 



and then 



^PkQk 

k 



dH . d'T . d(To 

i: = E — 

fc k k 



-T 1 +T 2 ) 



d(jk 



Qk — T\ + 272- 
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Prom (l.C.2a) and (1.B.7), since V depends only on the g’s, 

H = Ti -h 2?2 — (To + 7 i+72) + P = —To + T 2 + P- 

For natural systems, To = Ti = 0 and then T = T 2 - In an inertial reference 
frame, the system is natural, and therefore 

(1.C.15) 

that is, the Hamiltonian coincides with the total energy and (1.C.14) repre- 
sents the energy integral. 

In a noninert ial reference frame, with To and Ti different from zero, if H 
does not depend explicitly on time, the first integral (1.C.14) still exists, but 
it does not coincide any more with the energy integral. Let us look, as an 
example, at the case of a coordinate system i? (^, ry, rotating with angular 
velocity lo with respect to an inertial system O (x, z), the origins i? and O 

and the axes ( and z coinciding. The two coordinate systems are connected 
by the relations 

X = ^ cos ujt — T] sin (jot^ 
y = ^ sin cot T] cos u)t^ 

^ = c, 

and therefore the kinetic energy in i7(^,ry,C) will be^^ 

= + 2 w(^?7 - T ] i ) + + 77^ + . 

The momenta conjugate to 77 , C will be 

= m{i-ujrj), 

P-n ='m{r] + uj^), 

= mC 

SO that 

For H = PiQi - T+V, 

^ = E + CPc]-r+V = (r?P^ - +T + V. 

If V has a rotational symmetry around the z axis, Ti will not explicitly contain 
the time, and therefore H — const. This is called the Jacobi integral 
One can see immediately that it is not the energy integral; in fact, 

E = T+V = + p; + Pl) + V. 

^^From now on, to simplify the exposition, we omit the indices labelling the indi- 
vidual mass points. 
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But, in this case, the energy is also conserved, and then E — Ti — const, as 
well. This provides the conservation of angular momentum; in fact, 

E - n = - T]P^) + t ?^) 

and, transforming this to the inertial system, we get 
E — % = u m {xy — yx). 



1.10 The Integral Invariants — Liouville’s Theorem 

As we have seen, the Hamiltonian dynamics introduces, besides the n gener- 
alized coordinates the n conjugate momenta pi as independent variables. 
Going on to use geometrical language, we therefore have a 2n-dimensional 
space {phase space). From the point of view of differential geometry, every 
vector p related to a point q of the manifold Q {configuration space) is an 
element of T*(5^ {cotangent space in q), this one being the dual space of TQq 
{tangent space in q), which consists of all real linear functions on TQq. In 
fact, as we know, p — dC/dq = p{q^ q). The union of all cotangent spaces is 
called cotangent fibre bundle and indicated by T*Q. Then, the 2n-dimensional 
phase space with coordinates {qi,Pi) is, in geometrical language, the cotan- 
gent fibre bundle of the manifold Q (configuration space). By analogy with 
what we have done in Lagrangian dynamics, where we introduced the space 
of events R x Q, here we shall introduce the space R x T*Q, called by Cart an 
the state space.^"^ In this 2n -h 1-dimensional space, the problem of motion is 
completely geometrized and the complete solution of the Hamilton equations 
is represented by a family of infinite curves which fill without intersecting 
the space itself. We shall now deal with a problem introduced by Poincare^^ 
and later generalized by Cartan:^^ the problem of the existence of integral in- 
variants. These are quantities, represented by means of integrals, which stay 
constant during the motion and assume a particular significance if one looks 
at the motion of the phase points in the state space. 

To introduce the subject, we shall first consider a completely general 
system of equations, later moving on to the case that is of interest to us: that 
of a Hamiltonian system. 



Cartan: Legons sur les invariants integraux (Hermann, Paris, 1922), p. 4. 
Poincare: Sur le probleme des trois corps et les equations de la dynamique, 
Acta Mathematica 13 , 1-271 (1890). 

Cartan: op. cit. 
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The Integral Invariants 

Let us consider, in all its generality, an autonomous system (the conclusions 
will be the same in the case of a non-autonomous system - our choice is made 
only with the aim of simplifying the presentation) of the form (1.A.32), where 
the functions Xi{x) are in a given m-dimensional domain. The solutions 
will be functions 

Xi^^i{t\x\,xl,...,xl^). (1.C.16) 

In (1.C.16), the constants X 2 , • • • , indicate the values of a:i, ^2, . . . , Xjn 
corresponding to a fixed (initial) value of t, which can even be assumed 
equal to zero. System (1.A.32) can be considered to be the equations de- 
scribing the motion of a point (with coordinates Xi, X2, ... ,Xm) in an Tri- 
dimensional space. If we consider all the points occupying an r-dimensional 
region (r <m) i?o at t = 0, the same points at a time t will occupy a region 
Qf, still r-dimensional. If the r-dimensional integral of a given function F on 
the region i?o retains the same value for any t, i.e. 



/ FdQ= / FdQ, VL (1.C.17) 

we say that the integral considered is an integral invariant of the system 
(1.A.32) and r is the order of the invariant. Let us begin from the simplest 
case: r = 1. As we have seen in Sect. 1.2, (1.C.16) can also be interpreted as 
the definition of a transformation (depending on the parameter t) from 
to x\ 

(1.C.18) 

The operator transforms the point at t = 0, into the point x, at the 
time t. Let us assume that the Jacobian of the transformation. 



J = 



d{(pi, LP2, , (fm) 

dix'l, , x°^) 



(1.C.19) 



does not vanish for the values of and t in which we are interested. Let us 
now consider, not a single initial point Xq, but the points of a whole curve, 
7 o. The transformation defined by will transform every point of 70 into 
a corresponding point of a new curve: 7^. In this way, we have a tube of 
trajectories which has the curve 70 as generatrix; every point of 70, as a 
consequence of the transformation will move like a particle of a fiuid: at 
the time t the particles that were on 70 at the time t — 0 will occupy the 
points of the curve (see Fig. 1.16). 

Let us now consider a vector field v = v{x) whose components Vi{x) are 
functions of class defined in the domain in which (1.A.32) are defined. If 
the relation 
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holds true for any curve 70 in the domain considered and for all values of 
then we say that the integral 

h = [ v-dx (l.C.21a) 

is a linear integral invariant, or a first-order invariant, for the system (1.A.32). 
When (1.C.20) is valid for any curve, open or closed, the invariant is said to 
be absolute; when (1.C.20) holds only for closed curves, the invariant is said 
to be relative. We can parametrize the curve 7^: let a be the parameter and 
0 and 1 the values of a corresponding to the end points of the curve. In 
this way, to every point of 7^ there corresponds a value of a and vice versa. 
Moreover, on the surface of the tube of trajectories, if we vary t (for fixed a), 
we obtain the trajectory of a particle of the fiuid; if we vary a (for fixed t) 
we obtain the simultaneous positions of all particles. Therefore the integral 
Ii can be written 



h 



/ 



V • dx 




L 



1 ■ 

Vi^—da. 

oa 



(1.C.22) 



Starting from (1.C.22) it is possible to show^^ that the necessary and sufficient 
condition for the validity of (1.C.20) is given by 










0. 



(1.C.23) 



Obviously, when the curve is closed, the first term vanishes and the resulting 
relation can be expressed by considering that the differential form 



f dvi 



\dxj 



dvj 

dxi 



Xj dxi 



must be the exact differential of a given function of Xi, ^2, ... , Xm- Before 
we consider the application of all this to Hamiltonian systems, it is worth 
looking at the other extreme case of integral invariants: when the order r of 
the invariant coincides with the dimensions of the space, i.e. with the order 
m of the system. Given a function M = M(xi, X2, . . . , x^), of class 
and defined in the same domain of the Xi (xi, X2, ... , Xm), lot us define the 
integral 

/^ = / Mdxi dx2 . . . dxm (1.C.24) 

JEt 

over an m-dimensional region Et with finite volume, which is the region 
occupied by a given whole of points at the time 1 . Eq is the region occupied 
by the same whole of points at t = 0, in order to have Im invariant it must 
hold that 



^^See, for instance, L. A. Pars: A Treatise on Analytical Dynamics (Heinemann, 
London, 1964) Sect. 21.6. 
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/. 



M J dx^ dx2 . . . dx 



= / 

JEt 



M dxi dx2 . . . dxm^ Vt, (1.C.25) 



which can also be rewritten as 

dim ^ f d 



dt 



— Ifn. — 



[ ^ (M J) dxi dx2 . . . dx^ = 0. 
Jeo dt 



Therefore, the necessary and sufficient condition for Im being an integral 
invariant of order m for the system (1.A.32) is given by 



-(JUfl=0. 



By applying the Liouville lemma, which states that 

j 



dXi dX2 dXm\ 

,dxi dX2 dXm) ' 

we can rewrite condition (1.C.26) in the form 

^ / » r tr \ dX[ \ — ' OXi 



i=l 



i=l 



(1.C.26) 



(1.C.27) 



(1.C.28) 



(where J 0 according to our hypotheses). The functions M satisfying 
(1.C.28) were called by Jacobi multipliers for the system (1.A.32). 



The Liouville Theorem 

Let us see, now, how the theory of integral invariants can be applied to 
Hamiltonian systems, that is, when the system (1.A.32) coincides with the 
system (1.C.3) of Sect. 1.9. In this case, according to (l.C.6-9), 

dn _dn _ dn _ dn 

dxi+n dpi ’ dxi dqi ’ 

where now i = l,2,...n = m/2. Immediately we have 

^ ^ ^ A / d^H _ d^n \ ^ 

^^\dpidqi dqidpi) 

and therefore, from (1.C.28), M — const, and in particular M = 1 is a 
multiplier that gives Im = const. The choice M = 1 makes Im = hn be 
the volume of a given region of phase space, and this region (defined as that 
containing a given number of points) keeps its volume invariant (yet changing, 
obviously, its geometrical shape). Going back again to the analogy with a 
fluid, we see that the phase space behaves like an incompressible fluid. Of 
course, the above conclusion may also hold true for non-Hamiltonian systems: 

^®See Pars, op. cit.. Sect. 21.7. 
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Every time 



f ^=0 

hi 



that is the vector field X has vanishing divergence, the phase volume is 
conserved. (Liouville’s Theorem). 



If we reconsider the linear invariants (still dealing with Hamiltonian systems 
as above) and the condition (1.C.23) in the case of a closed curve, we see for 



— Pi) — O 5 



(1.C.29) 



that the integral (l.C. 21 a) becomes 



/i = 





(l.C. 21 b) 



where the sum is written explicitly to emphasize that i goes from 1 to n only. 
The integrand of (1.C.23), by substitution of (1.C.29), becomes 





dvj 



Xjdx 



n+2 



= E (-|f )-'«< +E = -<‘n<i.p). 

that is, the exact differential of a function of the ^’s and p’s {—'H)- The 
integral (l.C.21b) is therefore a relative linear invariant of any Hamiltonian 
system: it is the well-known Poincare linear invariant. 



The Cartan Integral Invariant 



A generalization of the Poincare invariant is due to Cartan, who introduced 
the quantity 

=PiSqi-nSt. (1.C.30) 

Let us now consider the action integral (1.C.4) and more general variations 
than those considered before, that is, variations also implying changes at 
the extremes, for instance with the generalized coordinates qi, ^ 2 , ••• , qu) 
functions, besides the time t, of a parameter a. To any variation 5a of a 
there will correspond a variation of the action^^ integral: 



SI = 5 



rt2 2 f 

/ {PiQi -U)dt = iJs + \ qi- 

Jti 1 Jti K . 



m' 

dpi_ 



Spi - 



Pi 



m 

dqi 



Sqi^dt. 

(1.C.31) 



^^Cartan, op. cit. 
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Let us suppose, now, that we are considering a family of actual trajectories 
(i.e. corresponding to real motions of the system) depending on the parameter 
OL and that we limit ourselves to the interval t\^t 2 (also dependent on a). The 
variation of the action integral along these trajectories, owing to (1.C.31), will 
be 



= t=u 



If we refer to a closed tube of trajectories, i.e. to a family of trajectories 
closed in on itself (see Fig. 1.17), with every trajectory limited to a time 
interval (^ 1 ,^ 2 ), the total variation of the action integral when one goes back 
again to the initial trajectory (going along a closed curve around the tube) 
obviously vanishes. Therefore, if we integrate over a, we have 



and then 



j< - 

I I 




i — T-L6t) = const. 



(1.C.32) 



That is, given any tube of actual trajectories, the integral f cjs evaluated 
along a closed curve winding round the tube does not depend on the curve 
itself but only on the tube, f cjs is called the Cartan integral invariant. If, in 
particular, we consider a closed curve consisting of points corresponding to 
the same t (simultaneous states), then f = f Pi^Qi = const. 




Fig. 1.16 




In this way, we recover the Poincare integral invariant, which always re- 
tains the same value if the closed curve along which it is evaluated is made to 
slide along the tube (obviously, always corresponding to simultaneous states). 
As we have seen, a Hamiltonian system admits the Cartan invariant (1.C.32) 
as its integral invariant. It can be conversely shown, that if a system admits 
the integral invariant (1.C.32), it is a Hamiltonian system. In Sect. 1.5, we 
showed that if one considers the time t as a Lagrangian coordinate (t = Qo), 

^^Cartan, op. cit. 
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the corresponding conjugate momentum is po — —H. Therefore, in this new 
phase space^^ with (2n + 2) dimensions, 

(a ^ 0, 1,2, . . . ,n). (1.C.33) 

In this extended phase space (the name is due to Lanczos)^^ the relation 
between the Cartan’s and Poincare’s invariants becomes more transparent. 

The Poincare Recurrence Theorem 

The integral invariants are important, as we have seen, when one considers 
instead of one particular trajectory, a whole family of possible trajectories. 
Their use, as we have recalled, was introduced by Poincare in his celebrated 
paper on the three-body problem; in that paper there also appeared for the 
first time the theorem which is today called recurrence theorem. The argu- 
ment which is decisive in the proof of this theorem is provided by the property 
of the phase space which has already been codified in Liouville’s theorem, i.e. 
the property of behaving like an incompressible fiuid; in other words, the 
phase fiow preserves the volume. Let us assume, now, that the motion of our 
system is confined to a bounded region of the phase space: let D be this re- 
gion. Poincare’s theorem states that if is a transformation of D into itself, 
that is continuous, bijective and volume preserving, then any neighbourhood 
U of every point of D contains a point x which comes back again to U after 
repeated applications of the transformation Let to be the initial instant 
of time and r an arbitrary interval t — to > 0; then consider the subsequent 
transformations • • • , which transform the neighbourhood U 

to ^ 2 tU, ... , ^rnrU^ etc. Then G U for some integer n. The 

theorem also entails a “stronger” version of itself. That is, not only does it 
happen that for almost every x m. U at least one element of the sequence 
. . . belongs to f/, but also that, for almost every x in C7, there 
are infinite values of n for which x e U. A rigorous demonstration of 
Poincare’s theorem needs the concepts of measure theory; we therefore refer 
the reader to specialized^^ textbooks. Of course, in such a context, instead of 
volume preserving transformations, one speaks of measure-preserving trans- 
formations {Lebesgue measure). 

It should be remarked, at this point, that this theorem which appears to 
belong “naturally” to the subjects we have dealt with so far, is in reality of a 
completely different nature and can be considered the starting point of a new 
way of studying dynamical systems. From the “classical” point of view, the 
problem is that of explicitly determining the configuration of the system at 

^^Called “space of the states and energy” by J. L. Synge (op. cit.). 

Lanczos, The Variational Principles of Mechanics (Toronto University Press, 
1970; Dover, 1986). 

^^See, for instance, P. R. Halmos: Lectures On Ergodic Theory (Chelsea, New York, 
1956 ) p. 10. 
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the time as a function of the time and the initial conditions at t = 0. This, 
as we know, is almost never possible; Poincare’s theorem, therefore, renounces 
any claim to deal with individual trajectories and instead considers the global 
properties of all trajectories. 



The Use of Integral Invariants 



Let us now see what relations exist between the integral invariants and the 
first integrals for the system (1.A.32). Let us first examine the linear invari- 
ants and the necessary and sufficient condition (1.C.23). If we have a first 
integral of the autonomous system (1.A.32) given by 

F(xi, X 2 , . . . , Xm) = const, (1.C.34) 



putting Vi = dF I dxi and substituting in (1.C.23), we obtain 



F 



q ;=1 



+ 



a=0 




d‘^F 

dxjdxi 



Xjdxi = 0 . 



That is, whatever F = const, and of class satisfies (1.C.23) identi- 

cally: if (1.C.34) is a first integral of our system, then J{dF/dxi)dxi is a 
linear invariant for the same system. Conversely, if J{dU /dxr)dxr (with 
JJ — JJ{xi^ X2, ... , Xm) and C^) is a linear integral invariant, then still using 
(1.C.23), 



dU ^ 

dXr '' a=0 J^\dXidXj 



d^U 

dxjdxi 



j dx i — 0 , 



from which {dU /dxr)Xr is a function not depending on the x’s. Therefore 
d\J jdi — K — const, and dlJ/dt — X = 0 immedately provides the first 
integral U — Kt = const. 

Let us now consider the integral invariants of order m and the condition 
(1.C.28). If we know two of these invariants for the system and the related 
multipliers Mi and M 2 , substituting in (1.C.28), we have 



from which 



and also 



^ + MidivX = 0, ^ + M 2 divX = 0, 

dt dt 

1 dM2 _ J_d^ 

M 2 dt Ml dt 

Ml fj_d^ _ J_d^\ _ ^ 

M 2 V Ml dt M 2 dt ) dt \ M 2 ) 



Therefore, if Mi and M 2 are multipliers for the autonomous system (1.A.32), 
then Ml/ M 2 is a first integral for the same system. 
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1.11 Poisson Brackets and Poisson’s Theorem — 

The Generation of New Integrals 



Let us now return to Hamilton’s equations and suppose that they have a first 

integral F — F{qi,q 2 , ... ,qn, Pi,P 2 ,Pn', t), also depending on time, to 

consider the most general case. Hence 

d^_dF_ f dFdH dF dn ' 

dt dt ^ dqk^^ ^ dpk^^ dt ^ \dqk dpk dpkdqk. 



= 0. (l.C.35a) 



The expression contained in brackets, which is the sum of n second-order 
Jacobian determinants, is called the Poisson bracket and enjoys some re- 
markable properties, which we shall now study. We rewrite (l.C.35a) as 



dt 



dF_ 

dt 



+ (F,H) -0, 



(l.C.35b) 



having put 



n 



{F,U) = Y, 

k=l 



d{F,n) 

d{qk,Pk)' 



In general, if u and v are two functions (which we shall assume are C^) 
of the g’s and p’s and of then 



(u,u) = ^ 

k=l 



d{u,v) 

d{qk,Pk) 



du dv du dv 

dqk dpk dpk dqk 



(l.C.36a) 



The time derivative of any function, evaluated along a solution of the system, 
will be written, for the Hamiltonian systems, as in (l.C.35b), and Hamilton’s 
equations themselves will become 



Qk^iqk.n), Pk = {Pk.n). (1.C.37) 



Owing to the definitions we have introduced in (1.C.7) and (1.C.8), (l.C.36a) 
can be written as _ 



The following properties are easily checked: 



(l.C.36b) 



(u,v) = -(u,w), 
(c, u) = 0, 



d . . f du 



(li, = 0 : antisymmetry; 
if c = const.; 
dv\ 



+ 






(1.C.38) 



but checking the so-called Jacobi identity is a little more arduous 
(u, {v, w)) + (u, (w, u)) + (w, (u, v)) = 0, 
where u, v, iv are three functions of class C^. 



(1.C.39) 
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Owing to (l.C.35b), if Fi{q, p, t) and F 2 {q, p, t) are two first integrals 
of Hamilton’s equations, 



^F^ 

-^ + (F.n) = o, 



dt 



+ (F2,'H) =0. 



(1.C.40) 



Let us now evaluate 

where we have used (1.C.40). Then, by using (1.C.38), 

-(fi,F2) + ((Fi,F2),1W) = (F2 ,(Fi,?{)) + (Fi,(?/,F2)) + (7{,(F2,Fi)) =0, 
owing to the Jacobi identity (1.C.39). Therefore we have obtained 

-(Fi,F2) + ((Fi,F2),'^) =0; 



that is, if Fi and F 2 are two first integrals, their Poisson bracket (^ 1 ,^ 2 ) 
is also a first integral {Poisson’s theorem). At first sight, Poisson’s theorem 
appears to be a concrete possibility of generating new first integrals starting 
from two that are already known and of generating them in such a number as 
to allow the integration of the system. As a matter of fact, all is not so rosy as 
one might think; first of all, since the Poisson bracket with the Hamiltonian 
of any first integral not containing the time vanishes, in this case we need two 
first integrals different from the total energy and mutually independent, and 
we are seldom able to satisfy such a condition. Secondly, in the case of an 
integral depending on time, (F^H) = —dFjdt will be the new integral, but 
for autonomous systems this result can be obtained directly; obviously the 
successive derivatives d'^Fjdt^^ d^F/dt^^ . . . are also first integrals; however, 
they are not necessarily mutually independent. 

Nevertheless, sometimes it happens that new integrals that do not depend 
on those already found can be generated. As an example of this, let us consider 
the planar oscillator which has already been studied. With pi = qi^ p 2 = Q 2 , 
the Hamiltonian of the isotropic planar oscillator will be 

^ = + (1.C.41) 

The energies of the two modes being separately conserved, we can choose 
their difference 

B (1.C.42) 

as an integral independent of H. Moreover, owing to the rotational symmetry, 
the angular momentum immediately gives another integral independent of B 
and B: 



L = qiP2 - Q2P1- 



(1.C.43) 
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Their Poisson bracket will be 

C = (L, B) = 2pip2 + ‘2lu;^qiq2 — 4 ^12 = 2uj^ab cos 6, 

where we refer to (1.B.75) for the definition of A 12 . Therefore C is really a 
new integral, independent of the preceding ones. If we go on to search for 
new integrals, we find 

(C,B) = Au'^{q2Pi - qiP2) = 

(C, L)^2 [{pj - pI) + uj‘^{q'^ - ql)] =4 5; 

that is, no further new integral can be found in this way. 

For Hamiltonian systems, the lowering of the order by means of first 
integrals goes in the same way as for Lagrangian systems: the reduced system 
is of the same type (Hamiltonian) as the original system. Therefore, rather 
than repeat that type of procedure, we shall deal with a separate case, i.e. 
the reduction which can be obtained by means of the energy integral. The 
procedure has been described as the elimination of the time and is important 
in the n-body problem in celestial mechanics. We have seen in the preceeding 
section, dealing with the Cartan invariant, how the differential form 

Pi dqi P2dq2 Pn dqn -Hdt (1.C.44) 

is associated with a Hamiltonian system and in what way the one determines 
the other. 

If we have an autonomous system, T~L will not contain the time explicitly, 
and then T-L = h = const., or also (with po = 

/i+Po = 0. (1.C.45) 

Solving (1.C.45) with regard to pi, we can write 

^ {P2,P3, ■ ■ ■ ,Pn\ 91,92, • • • ,9«; Po) +Pl = 0. (1.C.46) 

The differential form associated with the system is 

Pi dqi ^P2dq2^ ...+Pn dqn + Po dt, 

with the 2n + 2 variables 



Pi 5 P 2 Pn •> Qh Q 2 i • ' ' 5 Qn 5 PO 5 ^ 
satisfying (1.C.46). Therefore we can also write 

P2dq2 + Psdqs + . . • + Pndqn + Podt - /C(p2,Ps, . . . ,Pn; ^i, • • • , Qn] Po)dqi 

and consider 

P2,... ,Pn; ^1,^2,... ,^n; PO, t 

as the 2n + 1 variables. But the system corresponding to this form will be 
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dqi die 
dqi dpi ’ 
dt dK 
dqi dpo ’ 



dpi 

dqi 

dpo 

dqi 



dqi' 

■— =0 (i = 2,3, ... ,n). 



(1.C.47) 



The last two equations are separate from the other 2n — 2, since these do 
not contain and po is constant. Therefore the outcome is that, from a 
Hamiltonian system with n degrees of freedom, we have passed, eliminating 
time, to another one with n — 1 degrees of freedom: 



dqi die dpi die 

dqi dpi ’ dqi dqi 



(1.C.48) 



So that the procedure is advantageous, K must be a one-valued function 
for the correspondence between the values of qi (which plays the role of a 
parameter) and the points of the trajectory in the configuration space to be 
one to one. Clearly, the system (1.C.48) is a system which determines the 
trajectory but does not tell us when the moving point is at a certain position 
on the trajectory itself. This must be deduced from the last two equations of 
(1.C.47). 



1.12 Canonical Transformations 



Obviously, the integration of the 2n canonical equations (1.C.3) represents 
the central problem of the Hamiltonian theory; since general rules do not 
exist which can be applied to integrate (1.C.3), it is convenient to investigate 
if first integrals exist (which enable us to lower the order of the system) or 
to proceed in performing variable transformations which make the system 
simpler and then easier to be integrated. It is clear that the second issue 
implies the first, as we have already seen speaking about transformations of 
variables which make one or more variables ignorable. An extreme case is 
when 1~L depends only on half of the canonical variables (and all of the same 
type) and not on the time; suppose that the variables in question are the 
Pfc’s: T-L — T-L{pi, P 2 , • • • , Pn)- We have immediately 



on ^ . dn 

Pi = =0, q^ = — = const. = 

dqi dpi 



that is. 



qi = (Jit Si^ Pi — const. = a^, Vi, 



where and 8i are integration constants. Therefore, in this way, (1.C.3) are 
immediately integrated. We shall deal with this case, which is of great impor- 
tance in applications, in a subsequent section. If we now consider, in general, 
the problem of “simplifying” the system (1.C.3) by means of a variable trans- 
formation, the first question is of what type must the transformation be to 
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it be meaningful. It seems quite natural to impose the condition that the new 
variables must also be canonical so that one can write new equations in place 
of (1.C.3) which in turn are canonical. 

According to a convention introduced by Whittaker, we shall call Qi^Pi 
the new canonical variables and JC the new Hamiltonian: 

Qi = Qi{qi,q2,--- ,qn, pi,P2,-- - ,p«; t), 

Pi = Pi{qi,q2,--- ,qn; pi,P2,--- ,Pn\ t). 

The condition which must be satisfied, so that Qz, Pi may be canonical vari- 
ables, is obtained by again imposing the stationarity of the action integral 
(1.C.4), but with the integrand now given by 



Y^PkQk-lC (1.C.50) 

k=l 

instead of (l.C. 2 b). A necessary and sufficient condition for having this is 
that the two integrands (l.C. 2 b) and (1.C.50) differ from one another by the 
total time derivative of a function W depending on 2n among the new and old 
variables and the time.^"^ In fact, in this way, d W will be an exact differential, 
its integral will depend only on the end points and not on the path of inte- 
gration, and i^ W — W{qi,q 2 , ... , ^n; Qi, Q 2 , • • • , Qn] t) hs variation will 
vanish at the end points. Therefore, one must look for an arbitrary function 
W = W(^i, Qi, Q 25 • • • 7 Qni i) which satisfies the condition 

J2Pkqk-n{qi,Pi,t) = (i.c.51) 

k k 

that is, 

Pk dqk - H{qi,Pi,t)dt = '^PkdQk ~ K.{Qi, Pi,t)dt + dW{qi,Qi,t) 

k k 

^ ■5-^ /dW dW \ dW 

= ^ PkdQk - )C{Qk, Pk,t)dt + ^ dqk + ^ dQkJ + ~^dt. 

For (1.C.51) to be satisfied, it must be the case that: 

dW{qi,Qi,t) ^ _ dW{qi,Qi,t) , dW{qi,Qi,t) 

Pk r\ 5 ^k "> ^ ' Qi 

oqk oQk dt 

(1.C.52) 



By inverting the second group of equations in (1.C.52), we can obtain 
qk = qk{Qi,Pi,t), and, by substituting in the first group, pk = Pk{Qi,Pi,t); 

a rule, the function W must depend on (4n + 1) variables, i.e. on q, p, Q, P and 
the time; since (1.C.49) provide 2n relations among them, the actual dependence 
is on 2n canonical variables and the time. 
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therefore (1.C.52) constitute the 2n equations of transformation. For the 
sketched procedure to be possible, the Pk must satisfy the conditions re- 
quired by the implicit function theorem, i.e. 



d^W{quQ,,t) 

dQk dqi 



(1.C.53) 



Obviously, this condition also guarantees the existence of the inverse trans- 
formation. In fact, starting now from the first of (1.C.52) we can obtain the 
Qi^s as Qi{qk^Pk^t)i with the Jacobian (1.C.53) ^ 0, and, by substituting in 
the second one, have the F^’s as Pi{qk^Pk, t)- Moreover, it should be remarked 
that W — W{qi,Qi,t) is not defined on a set belonging to the phase space but 
on a set of the Cartesian product of the two configuration spaces, x Rq. 
As a further remark, note that (1.C.51) or (1.C.52) does not define the most 
general canonical transformation; in fact, the most general form is obtained 
with the left-hand side given by c {J2kPk —Hdt), where c is an arbitrary 
constant. Usually, only the case c = 1 is taken into account. The function W 
is called the generating function of the canonical transformation: in fact, it is 
enough to know only this function (instead of 2n functions qi — qi{Qk, Pk^t) 
and Pi = Pi{Qk^ Pk^t)) to be able to write by means of (1.C.52) the new 
canonical equations. 

The condition (1.C.51) refers to the most general canonical transformation 
in which the new coordinates and the new conjugate momenta depend, not 
only on the old canonical variables, but also on the time; if we consider it for 
a fixed value t of the variable t, it will be 



J^Pkdqk = J2PkdQk+dW{q,Q,i). (1.C.54) 

k k 



Equation (1.C.54) is therefore the condition which must be satisfied for a 
transformation independent of time to be canonical. Conversely, if we consider 
(1.C.54) for various given values of the parameter t (and so for every value 
of t the transformation is canonical) and consider the function JC defined by 

dW 

/C^F + -^, (1.C.55) 

then, summing (1.C.54) and (1.C.55), we again get the condition (1.C.51). 
This shows that the necessary and sufficient condition for a transformation 
depending on time to be canonical is that all the transformations not de- 
pending on time, obtained by (1.C.51) by replacing t by an arbitrary ?, are 
canonical. Therefore, to set up rules for deciding if a given transformation 
is canonical, we can limit ourselves to considering transformations not ex- 
plicitly dependent on time. Therefore, it is enough to verify if the difference 
Xli Pi ^Qi ~ Pi ^Qi exact differential. In other words, the necessary 
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and sufficient condition for (1.C.49) to represent^^ a canonical transforma- 
tion consists in the invariance of the integral f Pi ^Qi evaluated along any 
closed curve, that is, 





Pi dQi = const. 



(1.C.56) 



The question now is to find for (1.C.56) equivalent formulations which are 
more directly applicable in actual cases. If we consider the integral f ^ • pi dqi 
along any closed curve in the 2n-dimensional phase space, we have, obviously. 




^ S Pi dqi. 

i=i 



Consider then, for any given i, the line integral f pi dqi along the closed 
curve resulting from the projection on the qiPi plane of the curve in the 2n- 
dimensional phase space. It represents, as we know, the oriented area bounded 
by the curve itself and therefore, if we introduce a system of orthogonal 
curvilinear coordinates {ui^Vi) in the qiPi plane, it will also be 



^ Pi dqi = j 



dpi dq, 



= f ( 

Jsi \dui dvi 



dpi dqi 
dui dvi 



dui dvi , 



where the term within the brackets is the Jacobian determinant of the trans- 
formation (qi^Pi) {ui^Vi). Denoting by S the increment that the canonical 
variables undergo when only v varies and by d that due to the variation of 
u only, the last integrand can also be written {5pi dqi — dpi 5qi) and then in 
addition 



^ / dqi dpi 

^ \dui dvi 
1=1 



dpi dqi 

dui dvi 



dui dvi 



n 

{dpi dqi - dpi Sqi). 

i=l 



(1.C.57) 



The right-hand side in (1.C.57) represents a bilinear differential form to which 
we shall very soon return. By repeating our procedure backwards, the integral 
becomes 






dqi dpi 
dui dvi 

dqi dpi 
dui dvi 



dpi dqi 
dui dvi 



dui dvi 



dpi dqi 
dui dvi 



dULi dVi . 



(1.C.58) 



The integral in the left-hand side must remain invariant under the transfor- 
mation {q^p) (Q^P)^ whatever the closed curve P (and then the surface 

^^These arguments have a local validity, i.e. “in small” , and leave the topology of 
the phase space out of consideration. 
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S having F as boundary) may be; from this the invariance of the integrand 
follows and therefore also the invariance of 



[Ui^ Vi 




dqi dpi 
dui dvi 



dpi dqj 
dui dvi 



(1.C.59) 



where we have introduced a new symbol called the Lagrange bracket of Ui 
and Vi^ which enjoys the property 



[Ui, Vi] = -[Ui, Vi]. 



(1.C.60) 



From what was said above, it follows that this is equivalent to the invariance 
of the bilinear differential form 

n 

Y^{Spidqi - dpiSqi). (1.C.61) 

i=l 

In Lagrangian dynamics, as we have seen, it appeared quite natural to define, 
by means of the kinetic energy, a distance and then the ds‘^ of a Riemannian 
metric in the configuration space. Equation (1.C.61) shows that, in the phase 
space of Hamiltonian dynamics, the invariant quantity is no longer the square 
of a distance, but an area. 

For establishing if a transformation is canonical or not, therefore we have 
now the rule: 

The transformations from the variables {qi,Pi) to {Qi^Pi) are canonical 
if they leave the Lagrange bracket (1.C.59) invariant, whatever the depen- 
dence of the g^’s, Pi’s, on u and v may be. 

As the parameters u and v are quite arbitrary, we can choose for them any 
pair Qi,Qk or Qi,Pk or Pi,Pk of the new variables and then construct the 
Lagrange brackets. If we do the same in the new variables, all the Q^’s and 
PiS in this case being mutually independent, we obtain 

[Qi. Qk] = 0, [Pi, Pk] = 0, [Qi, Pk] = (1.C.62) 

Therefore (1.C.62) provides a necessary and sufficient condition for a trans- 
formation {qi,Pi) {Qi,Pi) to be canonical. When the canonical transfor- 
mation depends explicitly on the time, the conditions (1.C.62) must hold true 
for any t. 

Now, exploiting the matrix form of the canonical equations and the prop- 
erties of the matrix J we have already brought in (see Sect. 1.9), we will 
obtain another equivalent condition to characterize a canonical transforma- 
tion. From (l.C.6-8), it is evident that 







dz 

du 



T 



(1.C.63) 
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where we have denoted by [u^v]^ the Lagrange bracket related to the old 
variables. Denoting by [u.v]^; — {dZ jdu^ J{dZ jdv) that related to the 
new ones, we must have, owing to the invariance. 



But if we call M — {dZ jdz) the Jacobian matrix of the transformation, then 



dZ\ ^Jdz 
du)~ ^\du 






Therefore 






and, hence, the invariance is verified if the Jacobian matrix obeys the condi- 
tion 

M^JM = J. (1.C.64) 

A matrix satisfying (1.C.64) is called symplectic. Consequently, we can ex- 
press the necessary and sufficient condition for a transformation to be a 
canonical one if and only if the Jacobian matrix of the transformation is 
symplectic. Reconsider now the definition (l.C.36b) of the Poisson brack- 
ets and exploit the matrix formalism to see what the relationship is between 
{u, v) and the Lagrange bracket [u, v]. To do this, it is convenient to introduce 
two m X m, with m = 2n, square matrices: 



u, u] — 


/ [ui,Ui] . 

[U2,ui] . 


to 


, {u,u) = 


/ {ui,Ui) . 

{U2,Ui) . 


{u\ , UjY \ 

{U2i Um) 




' ^1] 






' i,^mt ^1) • ' 


' • Ymj '^m) ' 



which we call the Lagrange matrix and Poisson matrix respectively and 
which have as a generic element, whose position in the array is labelled by 
i and J, the corresponding bracket between Ui and Uj. The 2n functions 
Til, • • • , Um must be considered arbitrary functions of zi, 22 , . . . , 2;m. If 
we evaluate the product [u^u] we obtain 



[u, u]{u, uY 



ou } \auj\dzj \dz 




(l.C.65a) 



where we have used (l.C.ll) and the properties of the matrix product. Equa- 
tion (l.C.65a) can be rewritten as 
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m 

'^[ui,Uk]{uj,Uk) = Sij {i,j = 1,2, ... ,m = 2n). (l.C.65b) 

fc=i 

Therefore, between the Poisson and Lagrange brackets a reciprocity rela- 
tionship exists: this means that the invariance of Lagrange brackets entails 
the invariance of Poisson brackets and vice versa. That is, (1.C.62) has as a 
consequence that 

{Qu Qk) = 0, {Pi, Pk) - 0, {Qi, Pk) - (5,;, (1.C.66) 

and vice versa. 



1.13 Generating Functions — 

Infinitesimal Canonical Transformations 

In (1.C.51) and (1.C.52) in Sect. 1.12 we have chosen for the function W the 
form W = W{qi,Qi,t), that is we have chosen the ^^’s and Q^’s (among the 
4n variables available) as the 2n independent variables; canonical transforma- 
tions of this type are called free canonical transformations, thus emphasizing 
that the Qfs can be fixed independently of the g^’s. It is clear that a transfor- 
mation like the identical transformation Qi = qi {i = 1, . . . , n) could not be 
included among the transformations of this type; therefore the problem exists 
of listing all the possible classes of generating functions. Since there are four 
groups of variables, {qi,Pi, Qi, Pi), we want to pair, in such a way to have in 
each pair a “new” group and an “old” one, there are also four possibilities: 

Wi{qi,Qi,t), W2{qi,Pi,t), W^{pi,Qi,t), W^{pi,Pi,t). (1.C.67) 

The generating function so far employed in (l.C. 51-53) in Sect. 1.12 is 
W\{qi,Qi,t). If we start again from (1.C.51), we can write 

'^^Pkdqk — Hdi = ^ PkdQk ~ K^dt + dWi 

k k 

— — QkdPk — l^dt -|- d{Wi + QkPk)^ 

k k 



from which 

'^Pkdqk + X] QkdPk + (/C - 'H)dt = d{Wi + ^ QkPk)- 

k k k 

If we now define 



W2{qi,Pi,t) =Wi{qi,Qi,t) + '^QkPk, (1.C.68) 

k 



we have 
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Y^Pkdqk + ^ QkdPk + {1C- H)dt 

k k 

_ dW2 ^ ^ dW2 ^ ^ I Y^ dW2 



dPk 



(1.C.69) 



and then 



_dW2 ^ r^_nj^dW 2 

dqk ’ dPk' dt ' 



(1.C.70) 



Equation (1.C.68) is a Legendre transformation (see the definition in 
Sect. 1.9) which, together with (1.C.70), defines the canonical transformation 
generated by W2{qi^ Pi^t). The above-mentioned identical transformation be- 
longs to this class. In fact, if we consider 



W2{qi,Pi,t) = Y,diPu (1-C.71) 



from (1.C.70) we have 



Pi — Pi', Qi — Qi', — T~i' 



By means of analogous Legendre transformations, we can obtain 



W^{pi,Qi,t) = Wi{qi,Qi,t) - '^Pkqk, 

k 

^^4(^27 Pi")^) ~ {qi-i Qi^ ^ ^ QkPk ^ ^^ Pkqk' 

k k 



(1.C.72) 



We summarize in the table below relations (1.C.52), (1.C.70) and those re- 
garding Ws and W4 defined in (1.C.72). The three determinants, analogous 
to (1.C.53), will never vanish. 



Generating functions 


qi 


Pi 


Qi 


Pi 


K-n 


Wi{qi,Qi,t) 




dWr 

dqi 




dWx 

dQi 


dWx 

dt 


W2{qi,Pi,t) 




dW2 

dqi 


dW2 

dPi 




dW2 

dt 


W3{pi,Qi,t) 


dW^ 

dpi 






dW:^ 

dQi 


dW^ 

dt 


Wi{pi,Pi,t) 


dpi 




dW4 

dPi 




dW4 

dt 



It is evident that it is possible to generate the identical transformation also 
by Ws (jpi,Qi,t) — - ^j^PkQk, from which, in fact, one obtains 



qk — Qk-, Pk — Pk 



ic = n. 
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Other bilinear generating functions are 

w, = J2<ikQk, 

k k 

which exchange the q's for the p's in an asymmetric way: 

Qi — Pi-) Pi — Qi’ 

A worthy particular case is when the transformation is merely a “point trans- 
formation”, i.e. when the new coordinates Qi are functions only of the g^’s, 
without the presence of the conjugate momenta: this can be obtained by 
means of transformations of the type W 2 = Ylk=i fk{Qj)t)Pk) which give 

Qi — fi{Qj ? 0 * 



Infinitesimal Canonical Transformations 



Consider now a transformation “close” to the identical one, i.e. which differs 
from it by an infinitesimal amount, in the sense that the variation of the 
Qi's and the pi's is infinitely small. Its generating function will then differ 
infinitely little from the one of the identical transformation. 

Therefore we shall write it in the form 

J2QiPi+^W2{qi,Pi), (1.C.73) 



where £ is a first-order infinitesimal and 14^2 is a differentiable function. By 
applying (1.C.70), we obtain 



Pi — Pi £ 



dW2 

dqi 



n - ^ ^^2 



(1.C.74) 



Equations (1.C.74) define an “infinitesimal canonical transformation” de- 
pending on the parameter s; neglecting second-order terms, from these equa- 
tions we also obtain 



Qi — qi + £ 



dW2 

dpi 



If we now rewrite (1.C.75) as 



Qi-qi = Sqi = e 



dW2 

dpi 



Pi = Pi -e 



dW2 

dqi 



(1.C.75) 



Pi~Pi = Spi = -e 



dW2 

dqi 



and consider the case in which s = 5t and W 2 = H, we have 
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that is, we recover the canonical equations written in terms of differentials. 
This means that the infinitesimal canonical transformation having -h 

6tl~L as a generating function (and therefore the infinitesimal time 5t as a 
parameter) is given by the evolution of the system in the infinitesimal time 
St. 

It can be shown that, by performing several infinitesimal canonical trans- 
formations, one again has a canonical transformation and hence, at last, the 
conclusion that a finite motion of our system consists of a sequence of an in- 
finite number of infinitesimal transformations generated by the Hamiltonian. 



1.14 The Extended Phase Space 

When defining the phase space for a system with n degrees of freedom, one 
considers the conjugate canonical variables (qk^Pk) as dynamical variables 
and the time t as an independent variable. However, sometimes it is conve- 
nient (and in certain cases necessary on principle, as for instance in the theory 
of relativity) to consider also the variable t as a dynamical variable, and then 
one describes the evolution of the system by means of the variation of a pa- 
rameter re, instead of the time. We have already treated, in part, the problem 
by introducing in Lagrangian dynamics the space of events (see Sect. 1.5) and 
the extended phase space for Hamiltonian dynamics (Sect. 1.10). We want to 
discuss just the extended phase space here in more detail. As we have seen, 
if the time t were considered a coordinate Qq, its conjugate momentum would 
be given by po = —'H, and one would have to handle a 2 n + 2 -dimensional 
space: the extended phase space. Therefore, the coordinates would be given 
by qo = t, qi, ^ 2 , • • • , and the momenta by po = ~'H, pi, P 2 , • • • , Pn- For 
greater clarity, it is convenient to use different symbols: 

xq = t, Xi^ qi, yo = -n, Vi = Pi (1.C.76) 

and Greek letters for the indexes running from zero to n; in this way the 
extended phase space will have coordinates Xg (with p = 0 , 1 , 2 , . . . , n) and 
momenta yg (with p = 0 , 1 , 2 , . . . , n). 

The action integral becomes 

T — r ( x'l x'2 ^ / 7 

Jw, V ^0 ^0 ^ 0 / 

where we have replaced the dependence on t by the dependence on a param- 
eter w: t = xo — xq{w), Xi — Xi{t{w)), etc. (see (1.B.30) for the method 
used). We have, in this case, as Lagrangian 

A r f r [ ^2 \ / 

A = Cxq = C[Xq,Xi,... ,Xn] ,-7 Xq. 

V ^0 ^0 ^ 0 / 

This is a homogeneous function of first degree in the n+1 variables Xq, , . . . , 
x'^ and, owing to Euler’s theorem, we also have 
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n p. . n 

^=0 e ^-0 

As a consequence, the corresponding Hamiltonian, which we shall call /C, will 
be equal to zero: 

n 

K. = '^ygx'g- A = a. 
q=Q 

The action integral, therefore, will be reduced to the completely symmetric 
form: 

pW2 rW2 / ^ \ ru)2 ^ 

1= Adw^ / i'^ygx'g-IC\dw= ^ygx'gdw, (1.C.77) 

Jwi Jwi \g=0 J -'^1 ^=0 



which does not contain any Hamiltonian. In the integral (1.C.77), which must 
be considered evaluated along a curve parametrized by u;, the ygS must be 
given in conformity with yo (xq, xi, . . . , x^; yi, 2 / 2 , • • • , Vn) = 0 (from 
(1.C.76)), which it may be convenient to replace by the more general equation 
of energy 

-r(x„2/J = 0, (1.C.78) 

instead of making one of the variables explicit. Then (1.C.78) is a relationship 
among the 2n + 2 variables xq, xi, . . . , x^, 2/o, 2/i, • • • 2/n and, if we want to 
deduce the equations of motion in the extended phase space from a variation 
of (1.C.77), in the same way as is done by (1.C.4), (1.C.5), we must take into 
account the condition (1.C.78). Therefore, using the method of Lagrange 
multipliers, we shall perform the variation of the integral 




The multiplier A will be a function of w which, with a suitable choice of w 
itself, can be put equal to 1. We have therefore to perform the variation of 




which yields 



SI = 




+ 




dr 

dyg 



hi 



n 

dw = ygSxg 

Q=0 



W2 



Wi 



dr ^ dr ^ 





dx 



dyg 



dw = 0, 



®®See J.L. Synge: Handbuch der Physik, III, 1 (Springer, 1960), Sects. 67, 68. 
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upon integrating by parts in the usual way. By imposing the condition that 
the end points of the curve remain fixed, we finally have, from Hamilton’s 
principle for the extended phase space, the canonical equations 



dxn 



dw 



dVe 

dw 



dr 

dxn 



(^ = 0,1,2,... ,n). 



(l.C.79a) 



dr 

dy Q ujuu 

where r plays the role of Hamiltonian; we call it the extended Hamiltonian. 
If we make the particular choice 



r=Po+H(go,^l,... ,9n; Pi,... ,Pn), 



(l.C.79a) give us 



d^ _ ^ d^_ __ _dn 

dw dpi dpi ’ dw dqi dqi ’ 
which are the usual canonic equations, and 



dqo 

dw 

dpo 

dw 



dr ^ 

— — = l^t = qo = w-{- const., 
opo 



dr _ _m m_m 

dqo dqo dt dt ' 



(l.C.79b) 



(l.C.79c) 



which coincides with (1.C.13). This means that the choice made for the mul- 
tiplier A corresponds to having chosen w = qo (the arbitrary constant can be 
put to zero) and that now (1.C.13) must be integrant part of the equations 
of motions in parametric form, with po one of the independent variables. 
Through (l.C.79b), therefore, we have verified the complete equivalence of 
the two formulations, in both ordinary phase space and extended phase space. 
It is worth remarking that since r is always independent of the parameter tc, 
in extended phase space every system is a conservative system. As to canoni- 
cal transformations in extended phase space, the starting point will of course 
be the variational principle: if one passes to new variables Xg^Yg^ 



= dW (1.C.80) 

Q 



will be the fundamental relation for the generating function of the transfor- 
mations in the 2n + 2-dimensional extended phase space. The function VF, 
which a priori could depend on 4n + 5 variables {xg.pg, Xg,Yg,w), owing 
to the 2n + 2 relations among the canonical variables, will depend only on 
2n + 3 independent variables, and therefore there will still be four possible 
classes of generating functions. The table of the generating functions, already 
established for the 2n-dimensional phase space, can still be used replacing t 
by w and taking into account now that z ^ = 0, 1, 2, . . . , n. A different and 
particularly important case occurs when (in the extended phase space) one 
wants to transform only the parameter t (as a function of coordinates) leav- 
ing the canonical variables unchanged. We shall dwell upon a transformation 
of this type in more detail since, besides the interest of the subject per se, it 
is also important for applications. 
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When varying /, we chose A = 1 and then ic = it is now obvious that 
if we want to transform the variable we have to take into account that 
the parameter w will also change. But a change in w entails a change in 
the condition (1.C.78), to which the choice for w is strictly tied. In fact, the 
elementary action is given by 



= Vedxe = ^ Ve^ dw = '^ 
e=o e=o e=o 



and this determines dw. The same relationship among the 2n + 2 variables 
Xg^i/g^ however, can be expressed by means of different equations and, if we 
pass from T = 0 to T* = 0, both expressing the same relationship, the 
corresponding parameters will satisfy 

dw^ dr 
dw dr^ 



Let us verify this directly. Suppose then we have a system with n degrees 
of freedom described by a Hamiltonian that does not depend on time. In 
extended phase space, we have 



Qk = 



dr 

dpk 



Pk = - 



dr 

dqk 



(1.C.81) 



with r = po + U (^ 1 , 92 , • • • Pi, • • • ,Pn) and «; = f. If we now want to 
pass from the variable t to T by means of 



dt = f{qi,q 2 ,--- ,qn)dT, (1.C.82) 



with / an arbitrary function of g^’s (having the required properties of regu- 
larity) , as a consequence T will be equal to the new parameter ui* and 

C* = /(gi,g2,... ,9n)r (1.C.83) 



as well. In fact, the corresponding Hamiltonian system will be 



dqo 


dr* 


dpo 


dr* 


dT 


dpo ’ 


dT ~ 


dqo ’ 


dqi 


dr* 


dpi 


dr* 


dT 


dpi ’ 


dT ~ 


dqi ’ 



(1.C.84) 

= 1,2,... ,n) 



with r* ^ fr = flpo+H (gi, 92, ■ ■ • ,qn, Pi,P 2 , ■ • ■ ,Pn)]- The first equation 
will give dqo/dT — /, and then (1.C.82), and the second one 



dpo ,dpo „ ^ , 
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From the remaining equations, we obtain 



dqj ^ , dqi 
dT~ ^ dt 

dpi _ dpi 

dT~ ^ dt 



dpi dt dpi' 



djfn 

dqi 



df , .dH 

dqi dqi 




that is 

dpi _ dH 
dt dqi ’ 

since — 0 along a solution. Therefore the system (1.C.84) is completely 

equivalent to the system (1.C.81), as asserted. The considered transformation, 
then, has the effect of transforming the time according to (1.C.82) and the 
Hamiltonian according to (1.C.83), leaving the canonical variables invariant. 
It must be stressed that what we have stated has to be thought as operating 
in extended phase space, i.e. the transformation (1.C.82) cannot be carried 
out in ordinary phase space leaving the canonical character of the equations 
of motion invariant. To be clearer, let us demonstrate this assertion in detail. 
Suppose then we apply (1.C.82) in (1.C.3). Then qi = q'Jf, pi == p'//, where 
we have denoted by the prime the derivative with respect to T, and then 



q'.=fiqu....qn)^. 



(1.C.85) 



For (1.C.85) to remain canonical equations, a Hamiltonian H must exist such 
that _ ^ 

dpi dpi ’ dqi dqi ’ 

Differentiating, we get 

dqjY dpi) dqjdpi dpidqj dpi\ dqj) 

and then 

df dH _ df dH 

dqj dpi dpi dqj' 

However, owing to (1.C.82), one has 




moreover, since it must be the case that (at least for some i) dHjdpi ^ 0, 
it is also necessary that df /dqi — 0. Therefore (1.C.82), in ordinary phase 
space, is admitted only for / = const; and indeed one can show the same 
thing starting from a function /(pi, . . . ,Pn)- 




90 



1. Dynamics and Dynamical Systems - C. Hamiltonian Dynamics 



1.15 The Hamilton- Jacobi Equation 
and the Problem of Separability 



Given a system with n degrees of freedom, let , ^ 2 , • • • , 9n; Pi , • • • , Pn; 

be its Hamiltonian (which we assume, in the most general case, also depends 
on time) and (1.C.3) the corresponding canonical equations. Furthermore, let 
W\{qi^ Qi^t) be the generating function of a canonical transformation into the 
new variables Qi^ Pi. Then 



Pi = 



dWi 
dqi ’ 



Pi = - 



dWi 

dQi' 



/C = W + 



dWi 
dt ' 



(1.C.86) 



Now, if the new Hamiltonian JC is zero, the new set of canonical variables will 
consist of constants and the system will have been transformed, so to speak, 
into an equilibrium state. In fact, from tC{Qi^Pi) = 0, one obtains Qi — 0 
so that Qi = const., and Pi = 0 so that Pi = const. We call S{qi,Qi^t) the 
generating function Wi which gives this result. Therefore 



dS 

'H{qi,Pi,t) + -^ =/C = 0; 



(1.C.87) 



that is, the generating function S will have to satisfy the partial differential 
equation (Hamilton- Jacobi equation) 



By means of (1.C.88), the problem, which at the beginning required the 
solution of 2n differential equations, is changed into that of the solution of 
only one partial differential equation. Despite appearances, however, this does 
not lead to a simplification, since (in general) to solve a partial differential 
equation is one of the most complicated problems of analysis. Moreover, the 
problems that become explicitly solvable by means of (1.C.88) are, for the 
most part, the same as those that are solvable in another way. Therefore, 
(1.C.88) does not represent, so to speak, an easing of matters from the tech- 
nical point of view in the solution of problems; instead it consists of a method 
which, besides inspiring a deeper understanding of the dynamical systems, 
turns out to be particularly suitable for the study of theories such as celestial 
mechanics, quantum mechanics, etc. Let us now take care of the solution of 
(1.C.88). As is known, in the case of a first-order partial differential equation, 
a solution which contains as many arbitrary constants as are the independent 
variables is called a complete integral; since in (1.C.88) the unknown function 
S' always appears through its derivatives, a solution is determined up to an 
additive constant which can be omitted, being uninfluent ial in the canonical 
transformation. Furthermore, we recall that S, being the generating func- 
tion of a canonical transformation, must satisfy the condition (1.C.53). The 
method of solution of (1.C.88) will therefore consist of the following steps: 
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1) Obtaining a solution S containing n essential integration constants ai , . . . , 
an- Since one must have 5 = 5(^1, . . . , Qi, ... ,Qn,t) the constants 
will be the new canonical coordinates: Qi = ai. 

2) According to the second equation of (1.C.86), putting —dS/dai — Pi = 
(3i, where j3i are n new constants. In fact, since /C = 0, then also Pi = 
—dKjdQi — 0 and Pi = const. 

3) Inverting these relations to obtain the Qi = fi{ai,a 2 , - - - , /?i, /? 2 , • • • , 

(3n; t) which, together with pi = dS/dqi, will provide the initial canonical 
variables as functions of the time t and 2n integration constants A- 

In this way we have fulfilled the aim: to have the general solution of the 2n 
differential equations (1.C.3). Obviously, the complete integrals of (1.C.88) 
which generate a canonical transformation that satisfies the required condi- 
tions, i.e. Qi — ai, Pi = Pi constant, and /C = 0, are infinite in number. 
However, one among them is particularly interesting and suitable for clarify- 
ing the physical meaning of the transformation itself: that one in which the 
double n-tuple of the ai, Pi consists of the set of the initial values of the Qi,Pi, 
at a time taken as the origin of time, in which the Hamiltonian is zero. 

That is, one considers the motion of the system as consisting of a canonical 
transformation. In fact, if we evaluate dS/dt, we obtain 



dt 



and from this 



^dS . ^ dS - dS 

dP' agP' ^ dt ~ 



n 

Y^PiQi-'H 

2=1 



n 

dS = '^^Pidqi — Tidt, 

2=1 




dt = f Cdt, 
Jto 



that is, the action of the system. If the Hamiltonian 1~L does not explicitly 
contain the time (this is nearly always the case in the problems of celestial 
mechanics) one obtains a considerable simplification. In this case we shall 
define a generating function 



5 = S'^{qi,q 2 ,... ,qn; ai,... ,a^) - ap, (1.C.89) 

where is independent of time. By substituting (1.C.89) in (1.C.88), we 
obtain _ _ 

^ / dS\ dS ^ / dS^\ 

where it is evident that one of the constants, a\ , turns out to be equal to h 
(the constant corresponding to the energy integral). Furthermore 



(ii = 

A = 



dS 

dui 

dS 

da\ 



da, ^ 

dS* _ dS* 
dai ^ dh 



dS* 



Pi 



dqi 
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Equation (1.C.90) is sometimes called the stationary Hamilton- Jacobi equa- 
tion or, directly, the Hamilton- Jacobi equation tout court it being mistaken 
for (1.C.88). The function 5* in it is also called the reduced action. A calcu- 
lation, analogous to that performed above for (1.C.88), now gives 

pt n ^ti 

S*= '^piqidt = 2 / Tdt, 

2=1 

that is, the Maupertuis action (1.B.99). Regarding the solution of (1.C.90), 
as in the case of (1.C.88), general methods which can be applied do not exist. 
However, there is a particular class of problems, separable problems, for which 
it is possible to find a complete solution: in these cases the partial differential 
equation in n variables splits into n ordinary differential equations in one 
variable; they are then immediately integrable. The method of separation 
consists in the following: one tries to solve the partial differential equation by 
putting S'*" equal to a sum of functions, each depending on a single variable: 

5 * = Si (qi) -h S2 (92) + . . . -h S'* {qn)- 

There is also the case of partial separability, that is, the case in which only 
one, or more (but not all), coordinates are separable. For each of them, 

S* ^ Si {qi)-^S'" (^1,^2,... ,gn)- 

This resumes the old argument of ignorable coordinates, and consequently it 
is quite natural to ask oneself in what way it is possible a priori to know if 
a given problem is separable (completely separable) or not. From a general 
point of view, the question is still unsolved, but for a certain class of prob- 
lems we have general rules stated in a theorem due to Stackel (1893-1895) 
and further developed by Darboux, Levi-Civita, Burgatti, Dall’Acqua, Eisen- 
hart, etc. Referring the reader to Chap. 5 for a detailed study of the results 
concerning the motion of a mass point in a given potential, here we limit our- 
selves to mentioning the Stackel conditions. If one considers a natural system 
[T — T 2 ) and a system of orthogonal curvilinear coordinates, then 




Moreover, one assumes that the coefficients Ci and the potential U are func- 
tions of class of • • • ,qn in a domain of the configuration space Q 

and Ci > 0, Vi. The Stackel theorem states that the system is separable if and 
only if a regular nxn matrix exists (with element Uij depending only on the 
qi's) together with a column vector wj (with its components depending only 
on the qi's) such that 

n n 

^CiUij=S{, ^CiWi = U {j = 1,2,... ,n). 

i=l i=l 



(1.C.91) 
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As a consequence, if {vij) is the inverse matrix of {uij)^ one has 

n 

Ci = vii^ C/ = (1.C.92) 

2=1 

In actual cases, one can directly verify the separability hypothesis by 
substituting (1.C.91) in (1.C.90). If the operation meets with success, every 
conjugate momentum becomes 

^ ^ dSliq,) 

^ dqk dqk 

that is, a function of the sole coordinate qk, and the partial differential equa- 
tion splits into n ordinary differential equations. The dynamical system with 
n degrees of freedom can then be considered a superposition of n systems 
with only one degree of freedom. However, this result must not be inter- 
preted from the dynamical point of view, because the actual equations of 
motion are contained in 

<9 S'* <9 S'* 

-^=A, (1-C.93) 

which are not separated, since certain and also h in general will be present 
in more than one of the S^^s. Furthermore it must be emphasized that the 
separable nature of a problem is not so inherent in the “physical” features of 
the dynamical system considered as in the chosen system of canonical coor- 
dinates. It may happen that some problems are separable only in one system 
of coordinates and some other problems (for instance, the Kepler problem) 
are separable in more than one system of coordinates. Unfortunately, rules 
do not exist to find the “right” coordinate system. Let us suppose we have 
to handle a separable system and we have determined a complete integral of 
(1.C.90), 

S* = S*{qi,. . . ,q„-ai,a 2 , ■ ■ . ,a„), (1.C.94) 

where ai — h. Let us perform a point transformation on the constants 



ai =/i = /i(7i,72,-- - ,7«), 

0:2 = /2(71,72,--- ,7n), 

(1.C.95) 



0„ = /n(7l,72,-- - ,7n), 

where we intend such a transformation to be one to one and differentiable to 
the extent required. If we substitute (1.C.95) in (1.C.94), then 

s* = S* [qi,q 2 ,. . . ,q„; /i(7i,... ,7„),... ,/„(7i,--- , 7 «)]- 

By an abuse of notation, we shall go on to use the symbol 5* for the sake of 
simplicity and write 



s* = S*{qi,... ,5„;7i,--- ,7n)- 



(1.C.96) 




94 



1. Dynamics and Dynamical Systems - C. Hamiltonian Dynamics 



Now, 

= fihj) 

and ... • • • ^Tn) will be a complete integral of (1.C.90) of the 

type _ 

S{qi,'Yi,t) = S*{qi,'n) - 

As before in the case of ai = /i, a 2 , . . . , the 71 , 72 , •• • , 7 n can also be 
considered the new canonical coordinates Qi,... ,Qn* that is, we have a 
canonical transformation (for the sake of simplicity we still call 5* the gen- 
erating function) from the qi^Pi to the Qi^Pi (as we shall see only one half 
of them will be constant) which transforms the Hamiltonian 1~L into 

T-L = /l (71^72, • • • , 7 n) = /i(Qi,Q 2 , • • • ,Qn)- 

Instead of (1.C.90), we now have 

(i = l,2,... ,n). (1.C.97) 

A stationary Hamilton- Jacobi equation of the nature of (1.C.97), that is 
with the Hamiltonian depending only on one group of canonical variables 
(assumed to be constants), can also be obtained with the momenta Pi in 
place of the coordinates Qi, i.e. with H = 'H{Pi). In fact, it is sufficient to 
apply to the initial canonical variables {qi,Pi) a transformation generated by 
S{qi, Pi) = S'^iqi, Qi) -h Qi^i (which is of the W 2 type), demanding the 
constancy of the Pi's and proceeding as above. One ends up with 

H(^qi,^^=h{Pi). (1.C.98) 

The two equations (1.C.97) and (1.C.98) are completely equivalent; one 
will employ the one or the other according to the convenience of having to 
handle a generating function of type Wi or W 2 . Incidentally, we point out 
that the same operation can obviously also be performed for the Hamilton- 
Jacobi equation (1.C.88), by introducing a function S = S{qi,Pi,t) of type 
W 2 instead of S = S{qi,Qi,t). 

Looking at (1.C.97) and (1.C.98) it might appear that the problem has 
been complicated in comparison with (1.C.90), going from a Hamiltonian 
equal to only one constant to another one which (in general) will depend on 
n constants. As a matter of fact it is not like that because, since T~L depends 
on only one group of variables (for (1.C.98) the Pi's), the system remains 
explicitly integrable. In fact, from (1.C.98), 

dP 

Pi = 0 4=^ Pi = const.; Qi = — = const. = 7 ^ 4=^ Qi = -fit -h Si, 

(1.C.99) 



n 



dS*{qi,^i) 




qu 0 

dqi 


= 
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the Pi's being constant and then the canonical equations completely inte- 
grated. The advantage of a solution like (1.C.99), in comparison with the 
preceeding case with ai = /i, is the following: if the initial separation by 
means of the constants a\ = /i, ^ 2 , . . . , o^n induces one to represent the sys- 
tem by means of variables devoid of direct physical meaning, now, on the 
contrary, one has the possibility through P = P{Pi) of introducing variables 
which better reflect its intrinsic physical features. 

Moreover, it often happens that one wants to use the solution of a rela- 
tively simple problem (for instance, the unperturbed motion of two bodies) as 
the starting point for the solution of a more complex problem (for instance, 
perturbation theory). Therefore, it is necessary to choose the variables in such 
a way that the solution of the simplest problem is expressed in the simplest 
possible way. 



Application to the Case of the n-Dimensional Oscillator 



Let us see now, as an example of the application of the Hamilton- Jacobi 
theory, the case of the oscillator. Consider at first a generic anisotropic planar 
oscillator in rectangular coordinates. The Hamiltonian (for unit mass) will 
be 

^ = \{Pi +pI) + + ^IqI) (i.c.ioo) 

and the Hamilton- Jacobi equation (1.C.90) is 



1 

2 

We can put 



dqi ) \dq 2 J 



2n 






S^{quq2) = St{qi) + S^{q2) 



and obtain 



IfdStY 1 2 2 
2\dqJ 



+ 



(l.C.lOla) is satisfied by 



i/^V 



1 fdS:^Y 1 2 2 ' 
2 (^ 92 ) 



2\dq,J 

1 /ds:V 1 , 2 



= ai. 



tt 2 being a new constant. Since 



dS^ 

dqi 



dq2 



(l.C.lOl) 

(1.C.102) 



(l.C.lOla) 



(1.C.103) 



2 



>0, 
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from (1.C.103) one has 



(1.C.104) 

LU\ UJ\ UJ2 ^2 



From (1.C.102) and (1.C.103) 

S*{qi,q2) = j y^2o2 - ujlqldqi + J \j2{ai - 02 ) - (1.C.105) 

and from (1.C.93) 



da 

dS^ 

da2 



r/ 



dq2 



^^2(01 - 0:2) -<^292 
f dqi r 

J \/2a2 - w?<7? J 



= -/3i + t = t-to, 
dq2 



V'2a2 J \/2(ai - a2) - uj^ql 

Finally, from (1.C.106) and (1.C.107) 



(1.C.106) 
-/?2. (1.C.107) 



. , 1 . f ‘^ 2^2 

t — to = — arcsm 

^2 \y/2{ai-a2) 



and 



/?2 



1 

arcsm 

UJl 



y/2a2 



1 

H arcsm 

(j02 



^2Q2 



from which 



qi 

Q2 



U>1 



sin[o;i(f - to) - Wi/? 2 ], 



^/2(qi - 02 ) 
t02 



sin[u;2(f - fo)]- 



(1.C.108) 



The solutions (1.C.108) are the same as those already obtained in the 
frameworks of Newtonian and Lagrangian dynamics, and obviously ai — a 2 
and Q 2 represent the energies of the two modes of oscillation. It is clear from 
our procedure that the same method can be applied for an n-dimensional 
oscillator, with n however large. This is no longer the case if we give up 
the rectangular coordinates. Only two cases exist of separability in non- 
rectangular coordinates: the isotropic planar oscillator in polar coordinates 
and the anisotropic planar oscillator with uj 2 — 2o;i = 2lo in parabolic coor- 
dinates. We shall briefly discuss both of them, in view of future applications. 
For the isotropic oscillator, in polar coordinates 



1 

2 






dS* 

dr 



-h 



1 



ds^ 

d^ 



1 



2 2 

Lo r = cti. 



S^{r,^)^S:{r) + Sm^ 



(1.C.109) 

(l.C.llO) 
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from which 



d-d 






ds*y ( 4 '' 

dr ' 



,-*-2 2 
+ -0J r = ai, 



and then 



S*{r, d) = ■•& + j \j 2ai — — ^dr. 



(l.C.lll) 



From (l.C.lll), one obtains 



dS* 

da I 

dS* 



= t - to = 



-I 



dr 



\f^ 

-j3 = ^ f — 
da^ J 2 



1 . {ai — uu‘^r‘^) 

— arcsin 



2iUJ 






at — Lu^a: 



2^2 ’ 



dr 



\/2q!i 



1 . 

— d — - arcsin 



^2^2 ± 






2^2 



Lu^a, 






and the equation of the trajectory becomes 



ai - \/ol\- uS^ol\ sin[2(i? + /?)] 



(1.C.112) 



The constants ai and = Pi^ are respectively the total energy and the 
angular momentum. 

For the anisotropic oscillator with uj 2 — 2o;i = 2c«;, in parabolic coordi- 
nates (see Sect. 5.4) with A and [i defined by 



qi = \/%i, ^2 = ^(A-/i), A,/i>0, 



the Hamiltonian (l.C.lOO) becomes 

n = 2 

with 



APA+MP^ ^ ^ 1 ,2(A^+1*^) 
A “h /i 



-h -LO 

2 A + /X 



1 ■{x + n) 

The Hamilton- Jacobi equation is then 



1 . (A -h /i) 

Pm • 

4 pL 



dS^ 



+ d 



(dS^ 



(A -h /x) [ V 

and can be satisfied by putting 



V dji 



1 ,(A4 + m») 

+2“ ‘ITTT 



= «! 



dS\ 



1 



2A( j + 2 ^ ^ ~ ~ 2/x^ j + -a; /u — aifi — — 0 ! 2 , 



d5: 



*\ 2 



(1.C.113) 

(1.C.114) 

(1.C.115) 

(1.C.116) 
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where 0^2 is a new constant. By subtracting the two equations, we have for 
0!2 

i ai(A - /x), (1.C.117) 

where we have used 

dS* dS* 

dn 

The constant a 2 has been introduced as an integration constant, without any 
particular expedient; but if now, using (1.C.113) and (1.C.115), we rewrite 
(1.C.117) in rectangular coordinates, we realize that a 2 coincides with the 
first integral (1.B.94). Therefore, the introduction of parabolic coordinates 
has enabled us to obtain, in this case, the first integral in an automatic way, 
without any particular device. We shall see in Chap. 2 an analogous situation 
concerning the Laplace-Runge-Lenz vector for the Kepler problem. 



1.16 Action— Angle Variables 

We have already pointed out in (1.C.99) that, when the Hamiltonian of a 
conservative system depends only on n dynamical variables (and all of the 
same type: n coordinates or n momenta) and they are constants of the mo- 
tion, then the system is explicitly integrable. Now we will deal with a method 
that, when applicable, enables us to reach this condition. This method, con- 
cerning the bounded motions of conservative Hamiltonian systems, which was 
originally introduced in a problem of celestial mechanics (the motion of the 
Moon), was subsequently applied in several branches of physics, beginning 
with early quantum mechanics. For the sake of simplicity, we shall start from 
a system with only one degree of freedom. In this case the bounded motions 
of a conservative Hamiltonian system are periodic in time, and this can occur 
in two ways: 

1) different values of q correspond to different configurations of the system 
and both p and q are periodic functions of the time (and consequently of 
any variable being a linear function of time): after a period r, both p and 
q again take the initial values (libration); 

2) every time q increases by a given constant amount qo, the same configu- 
ration of the system is repeated: after a period r, p again take the same 
value, whereas q is increased by qo {rotation). 

Figures 1.18 and 1.19 give the phase curves for libration and rotation 
respectively. Figure 1.18 represents the phase curve of the oscillator (see 
below) . 

It may occur that the same system can have both types of motion; the 
typical case is provided by the pendulum: for a small amplitude one has 
libration, whereas if the energy is sufficient for a rotation in the vertical plane 
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one has rotation. In Fig. 1.21, are shown the phase curves of the pendulum. 
The closed curves (libration) are separated from the open ones (rotation) by 
a dashed line (separatrix) which is the boundary line of regions with phase 
curves having different behaviours. Later on we shall discuss the case of the 
pendulum in more detail. In general, one has libration when the system moves 
between two states of vanishing kinetic energy; in the case of rotation, it is 
always possible to choose the variables in such a way that q is an angle and 
to normalize them so that go = 27 t. Let us suppose we have found a solution 
of the Hamilton- Jacobi equation (1.C.98) for our system (i = 1, only one 
degree of freedom) and let a = P and j3 — Q he the new canonical variables. 
Then /? will be a linear function of the time: /? = ^{t — to). Therefore we can 
write in the two cases 

libration: g (/? + 7 r) = q[l3)^ 

rotation: q{P yr) = g (/?) + 27T. 

Now we perform a point transformation from a and (3 into two new variables 
J and d so that d = This can be accomplished by means of a 

generating function 

7T 

from which it also follows that a — 2r:J and then J = In the 

new variables we have 



libration: q{d 2tt) = q(i3), 

rotation: q{d 2tt) = g (t?) + 2tt. 



(1.C.118) 



The comprehensive canonical transformation, from (g,p) into (J,'d), is still 
a canonical transformation of the type generated by the S which occurs in 
(1.C.98) since the one generated by S was a point transformation, and then 
J is proportional to a without depending on /?; and this means that the 
Hamiltonian depends on J only. Therefore, we shall continue, for the sake 
of simplicity, also to use the symbol S for the generating function of the 
transformation from (g,p) into (J, i9). Then 



oq 



dS 

dJ' 



(1.C.119) 



Through this transformation, p, as well as g, will be a periodic function of 
d with period 27 t; the Hamiltonian % will depend on J and the equations of 
motion will be 



J = const.. 



0 dn 

V = —— — const. = uj 
dJ 



^d = ujt + 6, (1.C.120) 



where uj represents the angular frequency of the motion, while the period (in 
time) is given by 27 t/u;. From (1.C.119), it follows that 
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^ “ dq\dj) ~ dJ\dq) ~ 



and then 



d 

dJ 



j) pdq = ^ ^ dq = ^ dd = 



= 27T, 



where / means integration over a whole period (in the case of libration a 
motion there and back for q\ in the case of rotation an increase of 27 t). From 



d 

dJ 




= 27T, 



we then have (considering the integration constant to be equal to zero): 







(1.C.121) 



Comparing (1.C.121) with the first of equations (1.C.119), one has that 2ttJ 
equals the variation undergone by S' in a whole period. Referring to Figs. 1.18 
and 1.19, 2nJ will be the measure of the area bounded by the phase curve in 
the first case and of the area bounded by the curve, the q axis and the two or- 
dinates at a distance qo = 2tt in the second case. Therefore 2ttJ is exactly the 
action integral over a period: J has the dimensions of angular momentum and 
d of an angle; they are examples of quantities called action-angle variables. 
In the plane J^d the phase curves become straight lines J = const, parallel 
to the d axis, and then Figs. 1.18 and 1.19 have their equivalent in Fig. 1.20. 
Since, in our case (only one degree of freedom), the generating function S* 
is given by = S — dJ, its variation in one period (in which d varies by 
27 t and S by 2'kJ) will be ZiS'* = AS - 2'kJ = 0. Therefore S'* is a periodic 
function of d with period 27 t. This follows from the determination (1.C.121) 
of J. If we had considered S* as our starting point, we should of course have 
imposed the condition of periodicity to attain the same determination of J. If 
we make use of S* = S*(g, d) as the generating function of the transformation 
{q,v) ('^5 we have 



dS^{q,d) , dS\q,d) 



(1.C.122) 




Fig. 1.18 



Fig. 1.19 



Fig. 1.20 
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To sum up, we can go from the variables [q^p] to J) by means of a 
canonical transformation generated either by S{q^J) or by i?). Every 

time the new momentum J is a constant of motion and the new coordinate 
is a linear function of the time, we shall say that J is an action variable 
and d an angle variable. 



The Linear Oscillator 

As an example of the application of the method of action-angle variables, 
consider first the linear oscillator. From the Hamiltonian 

tP' 1 

T-L = h - — const. = h, (1.C.123) 

2m 2 

we have 

P = 2m[h-^ j = ^ ^ pdq ^ ^ j ^ ^2m{h - dq, 

where ±.q are the roots of muj‘^q^ = 2/i, that is, the coordinates of the points 
where the motion is inverted. Thus J — h/uj, and then the Hamiltonian in 
action-angle variables will be 






with the frequency lo given by 



uo — 



dn 

dJ' 



(1.C.124) 



(1.C.125) 



Equating the two expressions for the Hamiltonian (1.C.123) and (1.C.124), 
we obtain 

p = \/2mujJ — {mijoqY 
and, since d'djdq^ dpjdJ^ also 






arcsm 



/ pmuJ\ 



from which 



= a/—— sin??, p = p2mujJ cos??. (1.C.126) 

V muo 



The relations between (^,p) and (??, J) have been obtained, in this case, 
without resorting to S or which are given by 
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5 = / pdq = / dqy/2mLjJ — (mouq)'^ 

Jo Jo 

= J arcsinf ^4/ ) + \q\/2muoJ — m?(ju‘^q‘^ 

\ y ^J J 2 c.i: 

S'^ = S{q, J) — dJ = S — J arcsin ~ ^q\/2mujJ — m?u‘^q^ 

1 jmP‘Uj‘^q^ 222 ^ 2.0 

2 V sin d 2 



It is easy to check, using the first of equations (1.C.127) and of equations 
(1.C.126), that the variation AS corresponding to Ad = 27t is 2ttJ. Equa- 
tions (1.C.126), since J is constant, represent the parametric equations of 
the phase curve of Fig. 1.18. As we have seen in Sect. 1.3, by means of a scale 
transformation, we obtain as the phase curve a circle which is just a one- 
dimensional torus; in this case, using the variables q and pjmuj^ the radius 
of the circle will be ^J2J /muo. 



The Pendulum 

As a further example of an application of the method of action-angle variables 
to systems with one degree of freedom, let us consider the pendulum. Not only 
is it important per se, as a mechanical system, but also the structure of its 
Hamiltonian is the same as that present in most non-linear resonant problems 
of celestial mechanics. The pendulum can be thought of as consisting of a 
mass point P (of mass m) suspended in O (Fig. 1.22) by means of a weightless 
string of length 1. The angle q is measured from the downward vertical line 
through O; the point is subjected only to gravity, whose acceleration is g. If 
we set, conventionally, the zero of the potential energy to that corresponding 
to q = 7t/ 2, the Hamiltonian will be 

p2 

H = — — mgl cos q = h = const., (l.C.128a) 

2G^ 

where G = mP is the moment of inertia and P = Gu — Gq the angular mo- 
mentum. In (l.C.128a), the potential energy V{q) — —mgl cos q is a periodic 
function of period 2tt. By choosing units such that G = mP = 1 and putting 
h — g/l^ the Hamiltonian (l.C.128a) can be rewritten as 

p2 

H = — bcosq = h. (l.C.128b) 

The quantity b is positive and, for —b<h<b^ one has libration with q 
varying between the values 

q' = arccos {—h/b) and q" = — arccos {—h/b ) ; (1.C.129) 
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in Fig. 1.21 the curve is an ellipse. For h > one has rotation; the upper 
curve corresponds to positive P and the lower curve to negative P. 

At this point, it is natural for us to consider as phase space the cylindrical 
surface obtained by identifying the lines q = — tt and q = H-tt. In the limiting 
case h = b, the phase curve is given by the separatrix (the dashed line in the 
figure); in this case the pendulum attains asymptotically the position q = ir 
in an infinite time. In fact, from (l.C.128b), in the units we have chosen. 



b — -q^ — bcosq^ 



and then 



dt = 



dq 

y^2b{l + cosq) ’ 



integrating this, one will obtain t ^ oo, for g ^ tt. Let us see, now, in what 
way one passes to the action-angle variables in the cases of libration and 
rotation. The Hamilton- Jacobi equation will be 



= h{j), (1.C.130) 

with 

dS dS 1 f 

From (l.C.128b) 

J = — / \/2{h + b cos q)dq. (1.C.131) 



As is known, (1.C.131) is not evaluable by means of elementary functions: 
through suitable substitutions we arrive at an elliptic integral. In the libration 
case, we put 




hPb 

2b 



sinx. 



2b 
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From the definition, x varies from 0 to 2tt during a complete libration, and 
also 



We obtain 



I h b 

h + b . 2 

cos q — I r — sm 



and from (l.C.128a) 

p = cos q) = y/2{h -j- b) cos x = 2Vba cosx. 

As 



dq = 



2a cos xdx 

\/l — sin^ X 



we obtain 



cos^ xdx 



v2 Gin^ ' 



y/l — sm“ x 



dx 



a^ sin^ xJ 



(1.C.132) 



S = [ pdq = 4\/6 f 
Jo Jo 

= 4\/6 f \/l — sin^ xdx + — 1) / y=^ 

.do do V 1 - 

= 4v^[FJ(x, q;^) + — l)F"(x, a^)] , 

where the following elliptic integrals have been introduced:^^ 

px pX 

E{x,a^)— / a/i — sin^ u du, F(x,a^)= / 

do do yl — a^sin^ii 

In the same way, we can calculate J, this time by means of the complete 
elliptic integrals 



du 



p27T 



J 



= — / 2\/&q;cosx 
27T do 

>7t/2 



2a 



yl — o^sin^ r 



: cos xdx 



4 / — 4\/& f Vl — a^ sin^ xdx + (a^ — 1) / 
I 27 t [do ^ 



dx 



\/l — a^ sm" X 



^ Y J. 

= ~Vb[E{a^) + {a^ - l)K{a^)] , (1.C.133) 

where 

E(a2) - E{x = 7t/ 2, a^), ii:(a2) = = tt/ 2, a^). 

It remains now to evaluate 

dJ dadJ da\da) 



^^See, for instance, E. T. Whittaker, G. N. Watson: A Course of Modern Analysis 
(Cambridge University Press, 1927), pp. 512 ff. 
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Applying the known properties of elliptic integrals, 

d[aF{x^a^)] E{x,a^) sin x cos x 

da 1 — 1 — 0^2 \/l — sin^ x 

dE{x^a^) E'(x, — F(x, Q!^) 

da a 

d[aK{a^)\ _ E{a^) dE{a^) _ E{a‘^) - Kja^^) 

da 1 — ’ da a 

and considering S = S{q, a) as 

4:Vb E ^arcsin^— sin , a^^ + {a‘^ — 1) F ^arcsin^— sin , a^^ , 



one obtains 



oc 

— = AVbaF{x, a^). 
da 



Analogously 



^ = —Vb^ \E{a‘^) + (a‘^ — l)K{a‘^)] — —VbaK{a‘^). 
da 7T da ■' tt 



And finally 



,_dSfdjy^_ 7T _ 2 ^ 

da\da) 



(1.C.134) 



By substituting the definition of x in (l.C. 132-134), we obtain the wanted 
expressions for S,J,d. Regarding the Hamiltonian, it is understood that it is 
defined implicitly by (l.C. 133) through 



n{j) + b 



(l.C. 135) 



In the case of rotation, h > b, so that o; > 1. We should make substitutions 
different from the previous ones, so as always to have real square roots. From 
(l.C.128b) 

hFb = + 6(1 — cos^), 

V = \/2[(/i + 6) -6(1 -cosg)] = yj2{hFb)\ I - \ sin^ 

V 2 

with 1/a^ < 1. Then, 



pdq = - [ \/l - ^ sin^ X dx. 



where we have put x = qj2. Therefore 
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(Z^ + 6) / 1 

9 ^2 



(l.C.136a) 



Through an analogous procedure and exploiting the recalled properties of the 
elliptic integrals, we also get 



d = F 

K(l/a^) 



1 i_ 



(l.C.136b) 



It is interesting enough to dwell upon the evaluation of the frequency. We 
have already said that the Hamiltonian will result only through an implicit 
definition; however, this does not forbid us from obtaining the frequency 
explicitly. In fact, differentiating (1.C.135) we shall get 



uj ■ 



dU 

dJ 



= ^ah 




7T \/h 

2 K{a^) 



in the case of libration. Analogously, for the rotation case 



7T lh-\-b 

^ a: (1/^2) V 2 



If now, in (l.C.128b), we expand cosq around the equilibrium position 
g = 0 and take into account only the first non-constant term (this is equivalent 
to considering only small oscillations and then a linearized pendulum), the 
equation of motion will he q bq = 0. Therefore = cc;o is the frequency 
of the linearized pendulum: if we call ujl and ljr the frequencies of libration 
and rotation respectively, we obtain 

7t 1 ujR _ na 

^ ^ 2 K(a2)’ ^ ^ K{l/a‘^)' 



For h b, both libration and rotation tend to the separatrix, and then, by 
using the asymptotic expansion for K, we have the two frequencies tending 
logarithmically to zero when a — > 1: 



lim — 

2 In 



7T 




7T 


4 


ot^i uJo in 


4 


\/l — 


y/a^ — 1 



1.17 Separable Multiperiodic Systems — 

Uniqueness of the Action— Angle Variables 

Before extending the results obtained to systems with more than one degree 
of freedom, we introduce (following Born)^® some concepts and definitions 
regarding multiperiodic functions. 



®®M.Born: The Mechanics of the Atom (G.Bell and Sons, London, 1927). 
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Definition. A function F{xi, . . . ^Xn\ is periodic in the variables 

Xi,X 2 ,... ,Xn with a period r with components if it identi- 

cally holds that 

F{xi +Ti, X 2 +T 2 , ... ,Xn+Tn] 2/l,...) =F(xi,X 2 ,... 2 / 1 ,-..). 

If xi,X 2 , . . . are considered in an n-dimensional space, a period r is a 
vector in this space: r = (n, T 2 , . . . , r^). 

Proposition 1. If r is a period for the function F, mr {m positive or 
negative integer) is also a period for F. 

Proposition 2. If and are periods for F, is also a period 

for F. 

Proposition 3. If a function F has several periods . . . , then 

any linear combination of them Ylk=i with coefficients given by posi- 

tive or negative integers, is also a period. Proposition 3 obviously comes from 
1 and 2. 

Now we impose the following condition: The function F does not have 
arbitrary small periods (that is, given e > 0, no period r exists with |r| < e). 
The consequence of this is that every period r is an integer multiple of 
Ti (r = mri), where Ti is the smallest among the periods of the type At. In 
fact, from the imposed condition, we have that if r and Ar are two parallel 
periods of the function F, then they must be commensurable, i.e. A must be 
rational. If this were not the case (since, owing to a well-known theorem of 
number theory due to Kronecker, if A is not rational one can always find two 
integers m and m' such as to make (m -h m'A) arbitrarily small) one should 
have (by using 3) an arbitrarily small period (m -h m'A) t. If we now call p/g 
the number A (with p, q prime numbers), we have that r/g is a period for F; 
in fact, in this case, again resorting to a theorem of number theory, we know 
that two integers m and m' must exist for which mq + m'p = 1, and then 
m + m'A = mFm'p/q = l/q.By hypothesis, we must then get a period r/qi, 
which is the smallest among all the periods of the form r/q. Therefore we 
can express every period whose vector has a certain direction as an integer 
multiple of a given minimum period. One can show that all possible periods 
constitute an /-dimensional lattice: that is, / periods . . . , exist 

such that for any other period one has r = with integer c^, 

uniquely determined. 

We shall consider the case where the number / of periods coincides with 
the number of variables n. Therefore any one of the periods of the function F 
will be given by a linear combination of the type r = . We shall 

then say that the function F (xi, ^ 2 , . . . , 2/i, . . .), periodic in the n vari- 
ables Xi, X 2 , . . . , Xn, has a fundamental system of periods . . . , 

If, now, we perform a transformation in order to obtain a new fundamental 




108 1. Dynamics and Dynamical Systems - C. Hamiltonian Dynamics 



system of periods, this transformation, besides having integer coefficients, 
must also be unimodular, that is, with determinant equal to ±1. In fact, if 
and are two fundamental systems of periods (A: = 1, 2, . . . , n), it 

must be the case that where n = {riij) 

and m = {rriij) are two n x n matrices with integer elements, and then also 

^{k) ^ ^ 'y^ riikmjiT^^\ 
i j 

Consequently mn = 1 , and then detm detn = 1. As detn and detm are 
integer, the only solution is detm detn = ±1. 

We summarize the above by means of the following propositions: 

Proposition 4. For any function F(xi,X 2 ,... ^Xn] yi,...) periodic in the 
Xi,X 2 , . . . ^Xn variables a fundamental system of periods does exist. 

Proposition 5. All the fundamental systems of periods of a function are 
connected through linear transformations with integer coefficients and deter- 
minant equal to ±1. 

If now, in place of the variables Xi,X 2 , . . . we introduce in the n- 

dimensional space a new system of coordinates , • • • with axes par- 

allel to the vectors of a fundamental system of periods, then the function F 
expressed as a function of the variables • • • , has the fundamental 

system of periods 

= ( 1 , 0 , . . . , 0 ), 

r(2) = (0,l,... ,0), 

= ( 0 , 0 ,... , 1 ). 

By means of a suitable scale transformation one can obtain 

= (27T,0, ... ,0), 

= (0,27T, ... ,0), 

= (0,0,... ,27t). 

In this case, one says that the i^ 2 , • • • • • •) has fundamental period 

2tt. 

In conclusion we can state the further proposition: 

Proposition 6. By means of a linear transformation of the variables in which 
a function is periodic, one can transform the function itself into another one 
having fundamental period 27 t. 
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Finally one can demonstrate^^ the following: 

Proposition 7. All the sets of variables in which a function has fundamental 
period 27 t are connected through transformations of the type 

k 

• ■ ,Z^n;yi,- ••)> 

k 

I'n = '^Cnk^k + ‘4>n{T^\,V2, ■ ■ ■ , ?J„; J/i, . . .), 
k 

where the Cik are integer coefficients, positive or negative, whose determinant 
has modulus 1, and the il^i are periodic in the Vi with period 2n. The function 
F, being a periodic function in the variables z^i, ^^ 2 , • • • can be expressed 
as an n variables Fourier series: 

+00 4-00 

,l^n)= ■■■ E + - + 

Ai= — OO An = — oo 

where, if F(z/i,... , is real, cai,...,a^ ca_i,...,a_^ are complex conjugate 
quantities. In vector language, the above relation can be rewritten as 

fh = ^ca4<^-), 

A 

with the coefficients given by 



where the vector A is a vector with n components which span all the positive 
and negative integer values from — oo to +oc and 0 < < 27 t, V/c. 

Now the task is to exploit what was said above to extend the method of 
action-angle variables to multiperiodic separable systems, i.e. to systems for 
which the Hamilton- Jacobi equation is separable and in each partial plane 
{QiiPi) one has either a closed curve (libration) or a periodic function qi 
(rotation). Obviously, the simplest case occurs when the Hamiltonian of the 
system (with n degrees of freedom) consists of a sum of n terms, each of them 
containing only a pair of conjugate variables Pi\ 

"F = + 'H2{q2,P2) + • • • + 'Hn{qn,Pn)- 

®®For the proof, see M. Born: op. cit., pp. 74-75. 



(1.C.137) 
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The Hamilton- Jacobi equation (1.C.98), Sect. 1.15, is solved by separating 
the variables, by putting 



Hi yqi, = hi, with hi h 2 + . . . hn ^ h; 

therefore the motion corresponds to that of a system resulting from the su- 
perposition of n independent systems with one degree of freedom each. One 
can generalize the method of the preceeding section, by defining the actions 

Ji = ^ Pidqi (1.C.138) 

and expressing the functions Si by means of Qi and J^. Then 

The classic example in this case is that given by the superposition of n linear 
oscillators; considering unit mass and qi = ^2 = ^ 2 , • • • , Pi = 

± 1 , . . . ^p^—Xn (i.e. using the Cartesian components as canonical variables) 
we can refer to what we have written in Sect. 1.3. For n = 2, 



H — ~{pI pI) - {^jql 

and from (1.C.126), Sect. 1.16, 




Pi = \/2a;i Ji cos'll, P2 — \/2uj2J2 cos P 2 , 



with 



— (jOit “h 61 , 



The Hamiltonian will become 



P2 ^2^ d- 62- 



H — H“ <^2^2- 



(1.C.140) 



(1.C.141) 



(1.C.142) 

(1.C.143) 



As was shown in Sect. 1.3, (compare (1.C.142) with (l.A.41a)), the phase 
curves wind round tori and are closed if ui and U 2 are rationally dependent 
and everywhere dense in the opposite case. For n > 2, the conclusions are 
obviously the same; only the possibility of a graphic representation fails. Re- 
turning to the general problem, it may happen that the Hamiltonian does not 
result a sum of n independent terms as in (1.C.137), but that the Hamilton- 
Jacobi equation (1.C.98) can be solved by separation of variables as well, that 
is, by putting 

S = S'l(^l) + S2{q2) + . . . + Sn{qn)’ 

In this case pk = dSk/dqk depends only on qk. 



(1.C.144) 
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For any pair qk^Pk one has the situation^^ considered in the preceding 
section; that is, one has either libration or rotation. Every action integral of 
the form (1.C.138) is constant: therefore we shall assume that the actions Ji 
are constant momenta. The Hamiltonian will be a function of only the 
and S will be expressed as a function of the Qi and Ji] the angle variables 
conjugate of the Ji^ will be given by 



^ dJi ^dJi' 



(1.C.145) 



Let us now show that the variables Ji , di introduced in this way behave 
as d and J did in the case of only one degree of freedom, that is, the qi are 
multiperiodic functions of the with the fundamental system of periods 



(27T, 0, . . . , 0), 

(0, 27t, ... ,0), 



(1.C.146) 



(0, 0, ... , 27t). 

Let us evaluate the variation of 'dk during a period (of libration or rotation) 
of variation of qk, keeping the other coordinates fixed. If we call Ak the 
increment one has in making only qk vary then 

= j> {d'di/ dqk)dqk. 

Partially differentiating (1.C.145), we get 

ddi Si _ d y. dSi _ d dSk 

dqk ~ ^ dJidqk ~ dJi ^ dqk ~ dJi dqk ’ 

and integrating, 

^ S*** ‘ijJ S'*® “ 

with 5ik = 0 for z A: and = 1 for z = A:. 

Vice versa, if we consider the • • • , '^n) and increase 'di by 27 t, 

keeping the other i? fixed, one has that qi performs a complete libration (or 
rotation); the other q may depend on '&i as well, but go back to the initial 
value without having run a whole cycle (otherwise the corresponding z? should 
vary by 27 t). In this way it turns out that the q are multiperiodic functions of 
the d with the fundamental system of periods (1.C.146). Every qk can then 
be expressed by means of the Fourier series 

^^This refers to the projections onto each phase plane (qi,Pi) of the trajectory in 
the 2n-dimensional phase space. 
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(1.C.148) 

A 



Since 'd are functions of time, they can be obtained from the canonical equa- 
tions, and, as in (1.C.142), 



i9k + 



with 



dn 

^ 

OJk 



(fc = 1,2, ... ,n). 



(1.C.149) 



Prom (1.C.148) and (1.C.149) one then has 

(1.C.150) 

A 



with 

X ’ UJ = AiC^i -h X2UJ2 + . • . + Xn^ni 
X ' S — Ai( 5 i -h X2S2 + . . . + Xn^n- 

Here also, for every J, the phase curves will wind round the corresponding 
n-dimensional torus: if the uoi are rationally dependent, the curves will be 

closed; in the opposite case the curves will be everywhere dense on each 

torus. In the first case ujk — nkuo (where the are integer), V/c, and then 
one has a unique frequency, and the motion is periodic in the real sense of 
the word (the system is also said to be completely degenerate). In the second 
case the motion is said to be conditionally periodic. As to the function S'*" , in 
this case given by 

S* = S~Y,^kJk, (1.C.151) 

k 

since S increases by 27 t every time the coordinate qk runs a complete cycle 
(the other Qi remaining fixed), it turns out that it is a multiperiodic function 
with fundamental period 2tt. In fact, one has from (1.C.151) that, while S 
increases by 27 rJfc, dk increases by 2tt and the other di remain unchanged: 
then = 0 on every cycle. As usual, one can consider 5* as generating a 
canonical transformation from (qi,Pi) into (di,Ji): S'* = S'^{qi,di). Then 



Jk — 



dS'^{qi,dj) 

ddk 



dS'^{qi,dj) 

dqk 



(1.C.152) 



Now one can ask oneself if the integrals (1.C.138) which define the action- 
angle variables are fully determined by the set of variables {qk,Pk) in which 
the Hamilton- Jacobi equation is separable or not. It can be shown that the 
answer is different according to whether one has rationally independent fre- 
quencies (nondegenerate system) or rationally dependent ones (degenerate 
system). In the former case one sees that the Jk are uniquely determined by 
the set (qk.Pk) which makes the Hamilton- Jacobi equation separable; in the 
latter (degenerate system) this is no longer true: different sets of variables 
can exist, giving rise to separability, connected by transformations which are 
not only point transformations but more generally canonical transformations. 
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Stating the problem in more general terms, one can reach some conclusions 
which we shall explain without proof.^^ 

Let a Hamiltonian system be given whose function H = H {qi,q 2 ^ • • - ^qn‘-> 
Pi, P 27 • • • ^Pn) is independent of time and such that one can introduce new 
variables Jk^ by means of the canonical transformation generated by 



dS , dS 
oqk oJk 

Moreover, let us assume that the following conditions hold true: 

a) The configuration of the system is periodic with regard to the i^ki with a 
fundamental period 2n. 

b) The Hamiltonian l-Liqi^Pi) is transformed into a function h{Jk) depending 
only on the Jk- 

c) The function S'*" defined by (1.C.151) and generating the transformation 
given by (1.C.152) is periodic in the Pk with a period 2tt. 

Now one has that, if (a), (b), and (c) hold true and if among the 
ujk — dhjdJk no relation of rational dependence exists, then the Jk are 
uniquely determined up to a linear transformation with integer coefficients 
having unit determinant. In the case in which, besides the validity of (a), (b), 
and (c), relations of rational dependence also exist among the uok (degener- 
ate systems), it is always possible (by means of a transformation) to make 
a certain number / of the n frequencies be not commensurable (rationally 
independent), whereas the other n — f vanish. Having reached this condition, 
again the transformed Jk are uniquely determined, obviously up to the linear 
transformation with integer coefficients of the former case. 



1.18 Integrals in Involution — 

Liouville’s Theorem for Integrable Systems 



Given a set of n functions /i , /2 , • • • , /n depending on the qi , the pi and on 
the time t, they are said to be in involution, or to constitute an involution 
system, if all their Poisson brackets vanish identically, that is if 

(/„/,)= 0 , Vi,j = l, 2,...,n. 

Now we will demonstrate that, when the Jacobian 

d(Pl,P2,---,Pn)^ 

‘^'^See M. Born: op. cit., Sect. 15. 



(1.C.153) 
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if we consider the functions /i, / 2 , • • • , /n as n new coordinates Qi, . . . , Qn, 
(1.C.153) in addition guarantees the existence of n functions Pi, P 25 — ^Pn 
such as to constitute together with the Qi a new set of canonical variables. 
Therefore let us set 



fi{Q,P,t) =Qi, i=l,2,...,n. 



(1.C.154) 



In a neighbourhood of {p,q), since J 7 ^ 0, we can solve (1.C.154), thus 
obtaining the Pi : 

Pi = <Pi{q,Q,t). (1.C.155) 

If we now replace the Pi by the pi in (1.C.154), we again obtain n identities 
in the g, Q, t: 

fi(q,ip,t)-Qi=0, i = l,2,...,n. (1.C.156) 

By deriving (1.C.156) with respect to the qi, one has 



dfi dfi dipi ^ 

dqj dpi dqj 



(1.C.157) 



Let us now multiply (1.C.157) by dfk/dpj and sum over j from 1 to n: 
dfi dfk , dfi dpi dfk _ „ ^ 



dqj dpj dpi dqj dpj 



(1.C.158) 



If now in (1.C.158) we interchange i with k and subtract the equations ob- 
tained from equations (1.C.158) themselves, we obtain, taking into account 
(1.C.153), 

dfi dfk dpi dfk dfi dpi _ ^ 
dpi dpj dqj dpi dpj dqj 

which, interchanging the dummy indices in the second term, can be rewritten 



(LC.159) 

dpidpj\dqj dqi J 

Equations (1.C.159) constitute a system of homogeneous equations in the 

_ dpj \ 

\dqj dqi) 

and the determinant of the coefficients is certainly different from zero, being 
given by the only solution is then 

^ = Vj,l = l,2,...,n. (1.C.160) 

dqj uqi 



V = 1 , 2 , ...,n. 



(1.C.160) 



Consequently, a function must exist (we call it S in view of a forthcoming 
application) such that 



Pi — Piiq^ Qi 



dS{q,Q,t) 
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If, in addition to (1.C.161), we also take 



dS{q,Q,t) 

dQi 



(1.C.162) 



it is easy to check that (1.C.161) and (1.C.162) define a canonical transfor- 
mation (q^p) {Q^P) generated by = S{q,Q,t). When the n functions 

/i, /2, • • • 5 /n are first integrals for our system, we can also, according to what 
we have just demonstrated, consider them as new coordinates, constants of 
the motion: 

fi{Q,P,t) = ai, i = l,2,...,n. (1.C.163) 

Because of what we have seen in the preceding sections, the system is com- 
pletely integrable. Therefore, when we have a system of n (independent) first 
integrals in involution, the Hamiltonian system is completely integrable. But, 
as a matter of fact, we can obtain an even more meaningful result: if the n 
first integrals (1.C.163) are in involution, then the corresponding momenta 
(1.C.162) are also constant and we have all the possible 2n independent first 
integrals and therefore the complete solution of the problem. 

This result is due to Liouville.^^ To demonstrate it, let us evaluate the 
derivatives d(fi/dt along the solutions of the equations of motion 



d(pi _ ^ _ d'H{q,p,t) _ dipj dqj 

dt dt dqi dt dqj dt dt dqi dt 

where we have used (1.C.160). Therefore, in addition, 

d'H{q,p,t) _ d(pi d(pj dTi 

dqi dt dqi dpj ’ 



that is. 



dipi _ _md^^ 

dt dqi dpj dqi dqi' 



(1.C.164) 



where we have written as H = ^{q^OL^t) the Hamiltonian expressed as a 
function of the qi, ai and t, through the pi. The existence of (1.C.160) and 
(1.C.164) means the existence of a function S{q, a, t) whose exact differential 
is given by 



Pidqi + P2dq2 + • • • + Pndqn ~ Pdt — dS. (1.C.165) 



If, finally, we consider the canonical transformation generated by S according 
to (1.C.161) and (1.C.162), the new Hamiltonian will be )C = P + dS/dt\ 
but, on the other hand, (1.C.165) gives 






(1.C.166) 



Liouville: Note sur les equations de la Dynamique, J. Math, et AppL XX, 137- 
138 (1855). 
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and therefore /C = 0 and the new momenta Pi are constants of the motion. 
In this case, (1.C.162) will be written 



i=l,2, (1.C.167) 

UOLi 

and the constants Pi are the wanted integrals, also in involution being canon- 
ical momenta. In turn, S will be a complete integral of the Hamilton- Jacobi 
equation: 

When the integrals (1.C.163) are independent of t, the demonstration be- 
comes even simpler and in place of (1.C.168) one will have the stationary 
Hamilton- Jacobi equation. The existence of n first integrals in involution for 
a system with n degrees of freedom also has consequences of a geometrical 
nature (with those geometrical features we have already met but from a dif- 
ferent point of view). In fact, the existence of n integrals fi{q,p) implies that 
every trajectory of the system is confined on an n-dimensional manifold M 
defined by the fi{q,p) = ai = const., i = 1, 2, . . . , n. Since we are considering 
bounded motions, that is, motions occurring in a finite part of phase space, 
M must be a compact manifold. If, now, we define the n vector fields (with 
2n components) in phase space. 



Vi 



% 

dp ’ dq 



(1.C.169) 



on every M, defined by fixing an n-tuple of constants ai for the fi{q,p), the 
Vi are regular and linearly independent functions, such as the fi. 

Moreover, the n linearly independent vectors (1.C.169) are tangential to 
the manifold M. On the other hand, we know from topology (Poincare-Hopf 
theorem) that an n-dimensional manifold for which it is possible to build up 
n linearly independent vectors, tangential to the manifold itself and nowhere 
vanishing, has the topology of an n-dimensional torus. 

Therefore, to every n-tuple of constants for the /^, there corresponds 
an n-dimensional torus T^: these tori are called invariant tori^ because any 
orbit originating on one of them remains there indefinitely. The existence 
of these tori in phase space enables us to define the action-angle variables 
in a, so to say, intrinsic way. That is, now the angles di will be the angles 
on the torus and the actions li will be the n constants of motion which 
define the torus itself. For a completely integrable system, the transformation 
into action-angle variables is complete, i.e. we can imagine the phase space 
completely filled with tori; each trajectory will lie entirely on one torus or 
another according to the initial conditions and, as we have said, will remain 
there for ever. 
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Lie’s Theorem 

Let us suppose now that, for a Hamiltonian system with n degrees of freedom, 
we know / independent first integrals in involution with f < n. One can 
demonstrate that in this case Lie’s theorem^^ holds: If the independent first 
integrals in involution are also solvable for the pk {k — 1,2, ...,/), as in 
(1.C.155), then the order of the system can he reduced to 2{n — f). 



1.19 Lax’s Method — The Painleve Property 

Until now we have developed the discussion based upon the integrability of 
Hamiltonian systems in such a way that one might feel entitled to consider 
the terms integrability and separability synonymous. In fact, in all cases, the 
problem has been brought back to the search for a suitable set of variables 
which enabled us to separate the various degrees of freedom and then reduce 
the system to quadratures. The search itself for the first integrals picked out, 
in successful cases, the set of variables in which the system was integrable. 
However, there exist cases which do not come into the picture sketched above. 
The most famous example is the so-called Toda lattice^"^ The physical system 
consists of n particles, moving on a straight line, acted upon by exponentially 
decreasing forces. As xi, X 2 , . • . , are the coordinates of the n mass points 
on the straight line, the equations of motion are 

(j = l,2,...,n), (1.C.170) 



l,2,...,n) (1.C.171) 

k=l 

and Xn+i = Xi (periodicity condition). To integrate the problem, one can 
now apply a method, first described by Lax^^ in a quite different context 
(the search for solitonic solutions of the Korteweg - de Vries equation; then 
continuous systems and infinite degrees of freedom). Immediately afterwards 
it was shown that the method could also be applied to discrete systems and 
in particular to the Toda lattice. Let us see now, briefly, how this method 
works (in the case of the Toda lattice). The basic statement is that, if the 
equations of motion can be transformed in a matrix equation of the form 

^^See S. Lie: Begriindung einer Invarianten theorie der Beriihrungs transformatio- 
nen. Math. AnnalenVlll, 215-303 (1875). 

Toda: Waves in nonlinear lattice, Prog. Theor. Phys. Suppl. 45, 174-200 
(1970). 

D. Lax: Integrals of nonlinear equations of evolution and solitary waves, Comm, 
on Pure and Appl. Math. XXI, 467-490 (1968). 



dU 
- dx, 

with 

n 

^ = -E 
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L = BL- LB, 



(1.C.172) 



where the square matrices B and L have elements that are (in general com- 
plex) functions of Xi,xi, then the eigenvalues of the matrix L are first integrals 
and therefore do not vary with time. A matrix which, while varying as a func- 
tion of a parameter (in this case the time t) keeps its eigenvalue spectrum 
unchanged, is said to undergo an isospectral deformation. 

Equation (1.C.172) is the equation which regulates this deformation and 
coincides with the equations of motion of the system. Let us now try to 
demonstrate the assertion; that is, if we find two matrices L and B for which 
the equations of motion can be written in the form (1.C.172), then the eigen- 
values of L are independent of time. For this it is sufficient to demonstrate 
that the various forms of L{t) as t varies remain similar. Hence, one must 
find a nonsingular and diflFerentiable matrix U {t) for which 



-[U{t)-^L{t)U{t)]=0. 

As U{t) is nonsingular, U{t) = 1. 

Taking the derivative with respect to f, we get 



(1.C.173) 



from which 



dU 



-1 



dt 



= -U 



-1 



dt 



u-\ 



(1.C.174) 

(1.C.175) 



Consequently, (1.C.173) will become 

d 



^ (U-^LU) = -U-^^U-^LU + U~^^U + U-^L^ 
dt^ ’ dt dt dt 

dt dt dt 



= u-^ 



u 



+ BL-LB + L^U-^ 
dt dt 



because of (1.C.172). Let us now put 

17“^ = B{t). 



dU 



dt 



U, 

(1.C.176) 

(1.C.177) 



If the matrix B{t) is known, U{t) can be obtained as a solution of the equa- 
tion 

— = B{t)U{t), (1.C.178) 



with the initial condition 



U{0) = 1 , 



(1.C.179) 
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by assuming that the solution exists in a real interval in which B is con- 
tinuous as a function of t. By substituting (1.C.178) into (1.C.176), one has 
^ (f/-^ LU) = 0, and then L{t) = U~^ LU = L{0) = const. L{t) and i(0) 
are similar matrices; therefore they have the same spectrum and the same 
characteristic polynomial. Hence, we have demonstrated that, if the equa- 
tions of motion can be written in the form (1.C.172), then the eigenvalues 
of L{t) are first integrals. In practice it is more convenient to make use of 
the coefficients of the characteristic polynomial det{L — Al), rather than the 
eigenvalues A roots of the characteristic equation. It remains then to make 
out which integrals among these, found in this way, are independent. For the 
Toda lattice, (1.C.170) and (1.C.171), for the matrices L{t) and B{t) has 



been given the representation' 


46 




/h 


di 


0 


0 ... 


Q"n ^ 


/ 




ai 


^2 


d2 


0 ... 


0 


1 




0 


d2 


bs 


as ... 


0 




L = 


0 


0 


as 


64 ... 


0 


B = 




\ dji 


0 


0 


0 ... 


bn) 


V 



0 

-ai 

0 

V CLfi 



ai 

0 

— U2 



0 

d2 

0 



0 

0 

^^3 



dr] 

0 

0 



0 0 0 ... 0 / 

(1.C.180) 

Here: 2dk = 2bk — —Xk and it is easy to check that dk and bk 

satisfy 

ak = cik{bk+i - bk), bk = 2{dl - (1.C.181) 

Moreover it has been shown that the eigenvalues of Z, besides being first 
integrals of (1.C.170), are independent and in involution. As an application, 
let us look at the case n — 3: 

gl/2(xi X 2 ) gl/2(x3 Xi) 

gl/2(xi-X2) 

gl/2(x3-xi) gl/2(x2-X3) 






gl/2(x2-X3) 

-X3 



(1.C.182) 



and the characteristic polynomial, 
det(L - Al) = - ^ + A^ 



+ - ( -*3 + A I + - + - 



1/1 



4V 2 



= -A^ - -{xi + ±2 + is) A^ 



4V 2 



:X2 



+ A 



+ 1 ^ g(x2-x3) ^ - i2is - isii] A 

1 



1 

+ 8 



^^gX2 X3_^^^gX3 



+ 



Henon: Integrals of the Toda lattice, Phys. Rev. B 9, 1921-1923 (1974); 
H. Flaschka: The Toda lattice. II. Existence of integrals, Phys. Rev. B 9, 1924- 
1925 (1974). 
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Let us put 

-P(A) ~ 2 H" ^2 + ^3) 

_i - xiX2 - ±2±^ - xzxi A 

4 [ 

— X1X2X3 

(l.C.183a) 

If we rewrite this as 

= A" + ^/lA^ + -hX + L3 - (l.C.183b) 

because of what was said above, /i,/2,/3 will be first integrals. 

Recalling that, in this case, the Hamiltonian can be written 

n = ^{xl+xl+ ±1) + , (1.C.184) 

we then have as first integrals 'H, /i, /2, ^3- It is easy to see that h represents 
the total momentum and the relation 

h = - n (1.C.185) 

holds. It is not possible to interpret Is in a similar way, since it is a third- 
degree polynomial in the momenta. Moreover, it is easy to foresee that, as 
the number 4n of particles increases, the degree of the polynomials in the mo- 
menta which constitute the first integrals will also increase. We have already 
encountered an analogous situation when dealing with the anisotropic planar 
oscillator in the case of an integer ratio between the two frequencies (for n — 3 
we had a third-degree polynomial in the momenta). Furthermore, after the 
introduction of Lax’s method for the integration of discrete systems, it was 
discovered that the method can be applied in almost all problems which had 
been integrated by means of other methods. A schematization of the problem 
according to (1.C.172), in some cases, besides providing the first integrals, is 
also useful in the explicit integration of the equations of motion. 

To summarize, the method by separation does not exhaust all the possi- 
bilities that a system has to be integrated: separability and integrability are 
not synonymous. Therefore it is convenient to make a distinction between an 
integrability “a la Liouville” (the one just obtained by separation) and the 
integrability obtained in another way. 

Before we move on to outline another method of studying the integrability 
of a system, let us go back to the Toda lattice to point out a very interesting 
feature. Let us consider again the case n = 3, but now put the three particles 
on a circle (see Fig. 1.23). Now the coordinates will be given by the three 
angles V^3 and the conjugate momenta by the corresponding momenta 

Pi^P 2 ,P 3 • The Hamiltonian will be 
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Fig. 1.23 




7^ = 1 (p2 + pI+pI) + - 3, (1.C.186) 

where we have fixed the zero of the potential energy according to a convention 
which will be advantageous later on. 

Now perform a canonical transformation from {(Pi^Pi) to {^i,Pi), gener- 
ated by a function W 2 given by 





W2i^i,Pi) 


= P\PI +P 2 P 2 


+ (T 3 — Pi — P2)p3 


From 


dW2 




(z = 1,2,3), 




Upi 


^P^ 


we get 


Pi = P\, 


P2 = -P 2 , 


P3 = P 3 — Pi — P 25 



— Pi — Psi ^2 — p2 — Ps^ ^3 = P3i 
and then Pi P2 P Ps = Ps- The new Hamiltonian will be 

= l [‘^pf + ^Pi + Pi - 2AP3 - 2P2P3 + 2P1P2] 



(1.C.187) 



(1.C.188) 



As ^3 is an ignorable variable, P 3 (corresponding to the total momentum in 
the old variables) will be a first integral. If we now put P 3 = 0 (and this is 
completely justified), (1.C.188) becomes 



JC' = Pf + P| -f P 1 P 2 + -h - 3. (1.C.189) 



From a formal point of view, we now have to handle a system with two 
degrees of freedom characterized by the canonical variables , ^2 5 ? P 2 and 

the Hamiltonian JC' . Perform a new canonical transformation generated by 
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W 2 = ^ 



4^3 



(Pi - VSp2)^i + (Pi + y/3p2)^2^ 



to pass from Pi) to (qi,Pi)- We will obtain 



Qi = 



dWo 



Pi = 



dWn 



dpi’ ' ' d^i 

Therefore 



(1.C.190) 



(1.C.191) 



4v^ 
1 






4^3 

52 = J (^ 2 -^ 1 ), K=^K'. 



If, instead of the transformation generated by (1.C.190), we use the same 
transformation but with the multiplicative constant c = 1/(8a/ 3) (see 
Sect. 1.12), we obtain 



1 



Qi = 



Q2 = 



8v^ 

1 



Pi =Pl, 
P2 






(1.C.192) 



8^/3’ 



where Qi^Pi are the new canonical variables and /C the Hamiltonian. Lastly, 
let us also transform the time (in extended phase space): 



ICdt — JC dr^ 

with ICjlC = dtjdr = l/\/3. The final result is 

^ ^ (P? + pi) + ^ exp (2q2 + 2 VS 51) 

+ exp ( 2 q 2 - 2 V 3 qi^ + exp (-4^2) 



(1.C.193) 



(1.C.194) 



The function 1C = ^{qi,q 2 ,Pi,P 2 ) in (1.C.194) represents the Hamiltonian 
of a particle of unit mass subjected to a potential whose level curves {U = 
const.) are represented in Fig. 1.24. The potential grows, going from the inside 
to the outside. The system with Hamiltonian /C is obviously integrable, and 
we have performed canonical transformaHons. 

If we now expand the Hamiltonian K in series of qi and ^2 and retain 
terms up to the third degree, we obtain 

^ + ^ + ^) + 9? 92 - ^ 



(1.C.195) 
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Equation (1.C.195) is the Hamiltonian of a famous non-integrable system 
(Henon-Heiles),^^ which will be dealt with in Volume 2. The surprising fact 
is that what turns out to be non-integrable is the truncated expansion instead 
of the series itself: that is, the opposite of what one would normally expect 
occurs. It is evident, at this point, that the term —3 in (1.C.188) has been 
introduced to make the truncated series expansion give exactly the Henon- 
Heiles Hamiltonian. 

Let us rewrite this Hamiltonian in the most general form: 

^\{pI + pI + + y^) + Dx^y - ^ Cy^. (1.C.196) 

The corresponding Newtonian equations of motion are 

X — —X — 2Dxy, y = -y - Dx‘^ -\- Cy‘^ , (1.C.197) 

They constitute a system, in normal form, of two non-linear second order 
equations. To such a system one can apply^^ a further criterion (usually 
called the Painleve property) to find for what values of the coefficients D and 
C (besides, obviously, D = (7 = 0) it is integrable. Since the subject is tech- 
nically complex, for the moment we confine ourselves only to an outline. First 
of all, one must no longer study the equations in the real field, but in the 
complex field: that is, one considers the time t (and thus also x and y in this 
case) as a complex quantity; then the various functions involved must be an- 
alytical functions, that is, representable by means of power series. The theory 
of differential equations in the complex domain^^ says that if we are dealing 
with linear equations in normal form, the solutions can have singularities 
only at the points at which the coefficients of the equations themselves are 
singular. Therefore, we have fixed singularities, that is, ones not depending on 
the initial conditions. If, on the contrary, we consider non-linear equations, 
then we may also have moving singularities, that is, depending on the initial 
conditions. For example, the simple equation y' + y^^ = 0 has the general 
solution y = {x — c)~^ . 

One says that y has a pole (of the first order) at the point x = c, which 
moves as the arbitrary constant c varies. In order, the singular points may 
be poles (we have already seen an example), algebraic branching points (e.g. 
a square root), trascendental branching points (e.g. a logarithm), or essential 
singularities, like exp(— 1/t), whose expansion in powers of t has negative 
powers at any order. The criterion mentioned above consists in this: if a 
system of non-linear differential equations has, as moving singularities, only 
simple poles (Painleve property), then it is integrable. 

Henon, C. Heiles: loc. cit. in Footnote 2 in the Preface. 

^^See Y. F. Chang, M. Tabor, J. Weiss: Analytic structure of the Henon-Heiles 
Hamiltonian in integrable and non integrable regimes, J. Math. Phys. 23 , 531-538 
(1982). 

^^See, for instance: E. L. Ince: Ordinary Differential Equations (Dover, 1944). 
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In the last few years, the criterion has also been extended to the presence 
of algebraic branching points (e.g. for the Henon-Heiles Hamiltonian); in this 
case, one speaks of the weak Painleve property. Of course, the Painleve prop- 
erty constitutes, in a certain sense, a sufficient condition for integrability and 
not a necessary condition. Up to now we do not even have a fully exhaus- 
tive demonstration of the criterion itself; anyhow, one point is well founded: 
the criterion can certainly be applied when we have Hamiltonians that are 
algebraically integrable, i.e. when the first integrals are polynomials in the 
canonical variables. 

Historically, the first case in which the integrability of a mechanical system 
was demonstrated by studying the equations in the complex domain goes back 
to Sophya Kovalevskaya, who showed the integrability of the problem of the 
motion under gravity of a body one of whose points is fixed (in the case where 
the three principal moments of inertia are in the relation A — B — 2C).^^ 

At this point of our exposition, we can draw a conclusion, even if tempo- 
rary: given a Hamiltonian system (the same occurs if we are in the Lagrangian 
or the Newtonian scheme), no systematic method exists for determining if it 
is integrable or not. The methods we have explained until now do not enable 
us to give a definite answer in every case. We shall see in volume 2, which 
deals with almost integrable systems, what aid can be obtained from numer- 
ical methods to guess the existence of first integrals. What can be henceforth 
anticipated is that the method which has been very successful in dealing 
with these problems is a mixed one, that is a method consisting of numeri- 
cal investigations whose result indicates where to concentrate the analytical 
study. 

We should not even forget what Poincare maintained, that is, that a 
system of differential equations is only more or less integrable. 



Kowalevski: Sur le probleme de la rotation d’un corps solide autour d’un 
point fixe, Acta Mathematica XII, 177-232 (1888); Sur une propriete du systeme 
d’equations differentielles qui definit la rotation d’un corps solide autour d’un 
point fixe, Acta Mathematica XIV, 81-93 (1890). 




Chapter 2 

The Two-Body Problem 



The two-body problem is the only case of the A/'-body problem that one can 
solve completely; therefore it has been investigated in all its details and there 
are innumerable treatments of it even at an elementary level with special re- 
gard to its application to the solar system. But the “solution” of this problem 
also has certain features which deserve to be discussed both on their own and 
as a basis from which to begin studying more complex problems. 

This chapter is devoted mainly to this end, reducing to a minimum the 
explanation of traditional subjects that the reader can find in any one of the 
available textbooks on celestial mechanics. 
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2. The Two-Body Problem 



2.1 The Two-Body Problem and Kepler’s Three Laws 



As we noted in the introduction, from Kepler’s three laws it is possible to 
derive Newton’s law of universal gravitation: 

Between any two bodies^ of masses mi and m 2 at a distance r from each 
other there exist attractive forces F 12 and F 21 directed from one body 
to the other. The two forces have equal magnitude which is directly pro- 
portional to the product of the masses and inversely proportional to the 
square of the distance. 



The law of universal gravitation can thus be written 



Fi2 — F 21 — G 



mi m 2 



where the universal constant G depends only on the chosen system of units. 

Let us now consider, in a given inertial reference system, two bodies of 
mass mi and m 2 respectively; ri and V 2 are their distance vectors from the 
origin of the system. Moreover, to preserve greater generality, rather than to 
explicitly assume the dependence on the inverse square, let us assume only 
a generic dependence on r, leaving undetermined the form of the function. 
The accelerations^ for the two bodies will be respectively 



ri = -Gm 2 /(r)^l^, r, = -Gm, (2.1) 

where r — \v 2 - ri\. System (2.1) is of order 12 and thus the solution, in 
general, will depend on 12 constants. If it were possible to obtain ri and V 2 
explicitly as functions of time, we could obtain at any instant of time the 
position of each body and then the orbit described by it in any given interval 
of time. But we can adopt another point of view. If we multiply the first of 
equations (2.1) by mi and the second by m 2 and then sum them, we obtain 

mi ri + m 2 r2 = 0, 



and, by defining 






miVi -h m2V2 
mi + m2 



the distance vector of the barycentre of the two bodies, r, 
grating, we immediately get 



^c.m. — F h. 



( 2 . 2 ) 

= 0. By inte- 



(2.3) 



i.e. the barycentre moves in a straight line with constant velocity (a and 
b are two arbitrary constant vectors, therefore corresponding to six scalar 

^ The bodies must be point masses or spheres and homogeneous in concentric layers. 
^ We consider the equivalence law for the gravitational and inertial mass of a body 
to be already proved. 
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constants). Finally, if we subtract the first of equations (2.1) from the second 
and define r — V2 — ri, we obtain 



r = -G {mi +m.2) f{r)-. (2.4) 

r 

This is the equation of the relative motion and is the same equation we 
should have for the motion of a particle attracted by a fixed centre with mass 
mi + 7712. Therefore each particle moves as if it were attracted by a fixed 
centre (at the position of the other particle) with mass M = mi + m 2 . The 
orbit of each particle as seen from the other one is the relative orbit. Since 
the equation of the relative motion is invariant under the exchange (r, — r), 
the relative orbits are geometrically identical.^ The original problem of the 
two bodies of the system (2.1), has thus been split into the following two 
problems: 

1) the motion of the barycentre, 

2) the motion of a body attracted by a fixed centre. 

If one is able to solve problem 2), one automatically obtains the solution of 
system (2.1). 




Another point of view is that of referring the motion of the two bodies 
to the barycentre, that is, of considering the barycentre O' as the origin of 
a new coordinate system (see Fig. 2.1). Since the motion of the barycentre is 
rectilinear and uniform, the system with its origin at O' is also an inertial 
system and therefore the equations of motion will be of the same type. In 
fact, if we call r'l = ri — ^2 “ '^2 — '^c.m. the new coordinates, then 

mir'i + m2v'2 = 0 (2.5) 

^ “Two bodies attracting each other mutually describe similar figures about their 
common centre of gravity, and about each other mutually.” (Newton: Principia^ 
Proposition LVII, Theorem XX, op. cit. p. 164.) 
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and also mir^ = 777 , 2^2 ^ '^2 



jr^l- Therefore: 



, , mi -f- m2 , mi + m2 / 

r = -h T 2 = Ti = r. 



m 2 



mi 



25 



and, by substituting in (2.1), we get 

r' = -Gm 2 /(— r'l) ^2 = -Gmif(—r'^ 

\m2 J \mi J 



( 2 . 6 ) 



Equations (2.6) are formally independent equations, but actually only one 
equation is required in force of equation (2.5): there are then always six 
essential constants. From (2.6) it can be seen that each body moves as if it 
were attracted by a fixed centre having a suitable mass (for calculating the 
mass one needs to know the form of the function /(r)). 

So far we have dwelt on a rather elementary and well-known subject, to 
show that the two-body problem can be reduced to the one-body problem 
only because the two bodies are subjected to a force depending only on the 
mutual distance. In the Lagrangian scheme this is expressed by saying that 
^c.m. corresponds to three ignorable coordinates.^ We can conclude that in 
the Newtonian theory of gravitation, the two-body problem is completely 
equivalent to considering the motion of a test particle attracted by a fixed 
centre. This result, unfortunately, is no longer true in the theory of general 
relativity, where the problem of one body and that of two bodies are deeply 
different. However, when problems regarding the solar system are considered, 
in most cases one can consider the Sun to be fixed (mass ~ cxd), its mass being 
much greater than any other mass involved. 



Kepler’s Laws 



Let us now consider a particle of mass m subjected to a central force directed 
from the particle itself to the origin of the coordinate system. The equation 
of motion will be 

V 

mr = —mf{r)-. (2.7a) 



Later on we shall specialize /(r) to represent Newton’s law; in that case it 
will be /(r) = /i/r^ , with /i > 0; in fact /x = GM. Equation (2.7a) can be 
rewritten as a system 



r — 




(2.7b) 



We have three degrees of freedom and then a six-dimensional phase space. If 
we vectorially multiply by r the second equation of (2.7b), we obtain rxv = 
0 and then 



— (r xv) = vxv-\-rxv = 0, 



See H. Goldstein: op. cit., p. 71. 



4 
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that is, the vector c = r x v (angular momentum of the unit mass) is a 
constant of the motion. From the definition, it is obvious also that c • r = 0. 
This means that: 

a) If c 7 ^ 0, since r must be orthogonal to c, the motion occurs in a plane 
orthogonal to c. In this case, the phase space is reduced to four dimensions. 

b) If c = 0, since 

d f r\ {r X r) X r {r x v) x r cx r 
dt\r) ^.3 ’ 

we also have rjr = const. , therefore the motion occurs on a straight line 
passing through the origin. 

In the first case, if we choose the plane of motion as the xy plane and introduce 
polar coordinates in this plane, then r = {rcosd, rsin-i?, 0), c= (0, 0, c), 
and therefore 

c = \c\ = \r X v\ — \r X r\ = d. (2.8) 

Considering that dS = | dd is the area swept out on the xy plane by 
rotation of the angle dd of the vector r, it turns out that c — const, represents 
the Kepler second law. 

If we now scalarly multiply the second of equations (2.7b) by v, we have 
V V — -f{r) r~^{r • v) = -f{r) r~Vr = -/(r) 

dt 

and then d{v^ / 2) — —f(r)dr. By integrating this, we obtain 

= h{'>') + h, 

where h is an integration constant and /i(r) = f{x)dx, where the choice 
for a must be such as to make the integral converge. When /(r) = 
a = oo if p > 1, a = 0 if p < 1, a = 1 if p = 1. In the Newtonian case, 
/(r) = /i = ptr~^ . For a particle of unit mass, fi = U{r) is the 

potential function and V{r) = -U (r) the potential energy. The relation 
just obtained can be rewritten as 

i + V(r) = h, (2.9a) 

with V{r) = — /i(r). Then (2.9a) represents the law of energy conservation 
and h is the total energy. Expressing by means of polar coordinates, (2.9a) 
becomes 

1 (^2 ^ 

and, by using (2.8), 

l^' + ^ + nr)=h. 



(2.9b) 
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We shall denote by Vef[{r) the quantity^ (? j 2 -f V{r). We can therefore 
rewrite the relations obtained in the form 

= c, Ves{r) — h. (2-10) 



The second of equations (2.10) is the equation of energy conservation of a 
system with one degree of freedom and potential function /7efr(r) = — 

We shall call the new potential, the effective potential and the term c^/2r^, 
the centrifugal termf 

The case r = ro = const, will correspond to a circular orbit and in that 
case h will coincide with a minimum or a maximum of (the orbit is 

said Lagrange-stable or Lagrange-unstable respectively). For Ve^{r) to have 
an extremum in ro, it must be the case that 



dVe^jr) 

dr 



r=ro 



dV (r) \ 

dr r^ J 



that is, 



dV{r) 



dr 



( 2 . 11 ) 



To have a stable orbit (Wff(^o) = Kff(^))? the following must hold 



(PVesjr) 

dr‘^ 



> 0 , 



r=ro 



that is. 



d^Vjr) 

dr‘^ 



r—ro 




> 0 . 



By using (2.11), we finally obtain 



d^Vjr) 

dr‘^ 



+ 



3 dV{r) 






dr 



> 0 . 



( 2 . 12 ) 



If the potential is given by a power law, V{r) = and with Ka > 0, 

(2.12) gives 

a{a-l)Kr^-^ + 3aKr^-^ > 0 , 

that is a > —2. 

If we now go back again to (2.10), we see from the first equation that d 
varies monotonically, since d is always positive and thus d is always increas- 
ing. From the second one, we get 

^ = -^ = ±V^2 {h - Feff) 



^ What we have done corresponds, in the Lagrangian formalism, to the construction 
of the Routh function in the case of an ignorable variable (d in our case). 

® In fact, it corresponds to a repulsive force oc 1/r^. 
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and, by integrating,^ 




dr 

Y^2 /i — 2 y (r) — jr^ 



Prom the first of equations (2.10), 



(2.13) 



c , c , ( di 
dt? = — dt = — dr I — 
dr 



± 



dr 



r2 ^2 



Then, we have, in addition to (2.13), 



cdr 



r‘^y/2{h— V{r)) — c^/r‘^ 



(2.14) 



If the motion occurs in a bounded region, then rmin ^ r < r^ax and, 
corresponding to these values for r, the denominators of the integrals in 
(2.13) and (2.14) will also vanish, which in this case means that r = 0 (points 
of inversion of the motion). In the interval of time in which r varies between 
^min and Tmax and goes back again to Tmin, the radius vector corresponding 
to the moving particle rotates by an angle Ai) (remember that d{t) is an ever 
increasing function) given by 



Ad = 2 



f 



cdr 

ry/2r‘^ {h — V{r)) — 



(2.15) 



A necessary and sufficient condition for the motion to correspond to a 
closed trajectory is that this angle is commensurable with 27 t, i.e. Ad = 
271171 jn with integer m and n. If this does not occur, the trajectory will “fill” 
the annulus Tmin < < ^max in an everywhere dense manner; this statement 

can be proved in a manner analogous to the case of the planar oscillator in 
Sect. 1.3. It is clear that the condition Ad = 2'Kmln will only exception- 
ally be fulfilled, while in general it will not. However, two remarkable cases 
exist which give rise to closed trajectories, while still remaining in the class 
of potentials P(r) ex ; these are the cases a = 2, a — —1. We defer to 
Sect. 2.3 the proof of the fact that among all the central forces only the men- 
tioned potentials give rise to closed trajectories (Bertrand’s theorem). For 
now we proceed to a direct verification. A few lines are sufficient to check 
that the case a = 2 (elastic potential) is nothing but the planar oscillator 
already studied. As to a = —1 (Newtonian or Coulombian potential), return- 
ing again to our previous definitions, V{r) = — /i(^) = —lir~^. Instead of 
demonstrating that in this case one has Ad = 27rm/n (with integer m, n), 
we shall directly obtain the equation of the orbit. 

Let us again consider system (2.7), which, in the Newtonian case, will be 



r = V, 



r 






(2.16) 



7 



See Footnote 2, Sect. 1.1. 
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with the first integrals 

, 1 .9 u 

c = r X V = const. , h = -r + —7- — const. (2.17) 

2 2r^ r 

The energy integral, in the form (2.10), enabled us to arrive at (2.13) but, 
unlike (l.A.lla), this is integrable in terms of elementary functions but we 
cannot obtain by inversion an explicit function r = r [t) . However, we can 
obtain an explicit relation between r and (the equation of the orbit): this 
will be done by showing that a further vector exists which stays constant 
during the motion. From the already used relation 

d f r\ cx r 

dt\r) 7*3 ’ 



by multiplying it by —fi and by substituting (2.16), we have 



d f r 

ntVr 



i) X c. 



By integrating this relation, we obtain 



/X ^ J = V X c, (2.18) 

where 

V X c r 

e = = const. (2.19) 

II r 

The constant vector e is called the Laplace— Runge—Lenz vector. In the case 
c = 0, r/r — — e, and then e lies on the line of motion and |e| = 1. 

When c ^ 0, r • c = 0, and one sees immediately that e • c = 0; then e 
lies in the plane of motion. By multiplying (2.18) scalarly by r, one obtains 

/x(e •r-hr) = r- i;xc = rx'i;-c = c- c = c^. 



that is, 

(? 

e • r + r = — 



(2.20) 



If e = 0, then r = (? / [i — const, and the motion is circular; from = 
j ,2 r = 0 and = c and one has for the velocity v = const. == 

fi/c and, from the second of equations (2.17), h = /2 — fi/r = — /x^/2c^. 

Therefore, for e = 0, we have a circular motion with negative energy. In the 
case e ^ 0 , let us call u the angle between e and the x axis and f = d — u 
(see Fig. 2.2). By substituting in (2.20), one obtains ercos/ + r = c‘ 
which we rewrite as 






r = 



1 + e cos f 



( 2 . 21 ) 



Equation (2.21) is the equation of the orbit (a conic section with a focus at 
O). The distance r of the moving point Q is minimum when cos / = 1, e > 0. 
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The vector e has a length equal to the eccentricity and points towards the 
pericentre (nearest point to the focus). The opposite point, the farthest one 
from the focus (which, obviously, exists only for closed orbits) is called the 
apocentre and the angle / the true anomaly. Pericentre and apocentre are 
called apses and the line joining them the apse line. In the apses, being 
r = 0, the velocity is orthogonal to the radius vector. From the theory of 
conic sections, we know that: 



1) for 0 < e < 1 

2) for e = 1 

3) for e > 1 



the curve is an ellipse, 
the curve is a parabola, 
the curve is a hyperbola. 



Obviously, the case e = 0 is the circular orbit already considered (see 
Fig. 2.3). In the figure, we have 



^0 



1 + e’ 



n 



1 -e 



^0 



1 + e 



1 — e’ 



As e is a constant vector, uj will also be constant, and then f = 'd. Since, 
as we know, > 0 , / will be ever increasing with time and then (2.21) 
represents Kepler's first law. 

So far, we have found, for the system (2.16), two vectors and one scalar 
which are constants of the motion: this means seven scalar constants. As 
the system is of order six, obviously the seven scalar constants cannot be 
independent: there must exist at least one relation among them. Actually 
there are two relations; one has been already seen: e • c = 0. To obtain the 
second one, let us square (2.18): 

e • r + 1^ = v^(? 



r 



and, since 
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= 2{h+ -), e • r = r, 

r /X 

we get 

-l) = 2hc^. (2.22a) 

We therefore have 7 — 2 = 5 first integrals and our original system of order 
6 can be considered reduced to a quadrature. From (2.22a), we can further 
infer that: 

1) e < 1 (elliptic motion) corresponds to /i < 0, 

2) e = 1 (parabolic motion) corresponds to h = 0, 

3) e > 1 (hyperbolic motion) corresponds to h > 0. 

For an ellipse the parameter (c^//x in our case) is given by (1 - e^) a , where 
a is the semi-major axis. From (2.22a), we therefore have 

c2 = a/i(l-e2) (2.22b) 



(the semi-major axis depends only on the total energy). Moreover, from (2.8) 
and (2.22b), we have for the area swept out by the radius vector in unit time 



dt 




V^/xa(l - e^). 



On the other hand the area of the ellipse will be S — n ab — ir o? \/l — 

(b = semi-minor axis); if we call P the period of the motion and n the mean 
motion defined by n = 27 t/P, then 



Therefore, 



from which 



dS S nab nab _ 1 



na^\/l — e^. 






= /i. 



This is the analytic expression of Kepler’s third law. 

For the sake of completeness, we now present another method (Binet’s 
method) for obtaining (2.21). From (2.8), r'^dd = cdt, and then: 



d c d d^ _ c d f c d\ 

dt dd ’ dP dd \r“^ dd ) 



For the radial acceleration, which, for a force — /(r) ^ , will be given in the 
plane of motion by ar — r — rd‘^ = —f{r) , we have 
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c d f c dr 
dd \ dd 

1 dr 
dd 

and setting = 1/r, we finally get 



= -/W- 



As 



/I 

V ^ 



2 2 
CU 



_d_ 

M 



-cV = -/fl 

dd J \u 



from which 

— 4 - = ^ f /^1 

V M 



(2.23) 



(the differential equation of the orbit in the variable u). Equation (2.23) is 
invariant under the exchange d ^ —d : therefore if we assume 'i? = 0 to be the 
position of one apse we have immediately that the orbit is symmetric with 
respect to the radius passing through the apse itself. Obviously this applies 
to all apses. In terms of potential, by using the function /i(r) == fi (l/u) 
introduced at the beginning and exploiting the relation 



d 

du 

we have, in place of (2.23), 

d?u 

dd‘^ 



dr d Id 

du dr dr ’ 



1 A 

du 






1 

u 



(2.24) 



For the Newtonian potential, /i(r) = fi/r — fiu, (2.24) becomes 

d^^u II 

dd‘^ ^ ’ 

which, integrated, gives u = A cos {d — do) + fi/c^ and finally 

1 

A cos {d — do) + fi/c^ ' 

If we take do = 0 and put Ac? j \i — e, we again find (2.21). 

Let us pause for a moment to weighing the results we have obtained. For- 
mally our problem was the integration of a system of order six: system (2.7). 
By leaving the function /(r) undetermined and by using only the general 
properties of the central potentials, we have reduced the order of the system 
by four. To go further, determining another first integral, we were obliged to 
specify the form of /(r) and, in particular, to put /(r) = /i/r^. This enabled 
us to attain, in an elegant way, the equation of the orbit without being really 
obliged to perform an integration. In fact, the introduction of the vector e 
provides us with the equation of the orbit, directly from the expression of the 
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first integral. That is, (2.20) is a relation r — r{'d) which, by defining the true 
anomaly, becomes (2.21), the polar equation of a conic. At this point, as the 
system is reduced to order one, it is by definition integrable and then formally 
solved. As a matter of fact, things are not so simple: it will not be possible to 
write r = r(t) and i9 = '0{t) explicitly, but we shall be obliged to introduce 
another variable, connected with r through trigonometric functions, whose 
expression as a function of time will be obtained only in an approximate 
form. 
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The results of the preceding section, even though obtained very simply and, 
in the case of the deduction of the equation of the orbit, also with speed and 
elegance, do not arise from the application of a general procedure but rather 
from intuitions. In every case the task was in fact to guess by what quantity 
a certain expression had to be multiplied in order that the time derivative of 
another expression (the first integral) would be equal to zero. Essentially, one 
guessed, in each case, the right integrating factor. Now, on the other hand, 
we want to see how these results can be obtained by applying the formal 
tools we have explained in the Parts B and C of Chap. 1. 

Let us begin with Noether’s theorem. To make things easier, let us assume 
that we have already demonstrated that the motion occurs in a plane and 
then consider the problem in the plane. The Lagrangian of the Kepler problem 



C = 



1 , 



Irl’ 



SO that, if r = (g'i,g'2,0), and r = (gi,g2,0). 












2 ’ 
^2 



(2.25) 



for a unit mass. Let us rewrite the generalized Killing equations (1.B.41) and 
(1.B.42) and the expression for the conserved quantity (1.B.48), confining 
ourselves to functions that do not depend on time, since the Lagrangian is 
like this: 



6 



dqi ^ dqi 
oqi 



d^i . . . dr 

-^qi -qm-^ 

oqi oqi 



dqi 



dqi 



-qi 



dr 

dqi 



T I r ■ 

I = £ - — 

dqi 



r— ^ 

dqi dq. 

dq 

dqi 



qi: 



= i = l,2. 



(2.26a) 



(2.26b) 



ii-f- 



It is easy to check that the transformation defined by 

r = -1, = 0 (Vi), / = 0, 



(2.27) 
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corresponds to % = Qi dCjdqi —C — const. , i.e to the conservation of energy. 
For the spatial transformation with constant coefficients 

T — 0 , ~ ^ij Qj 7 

one has from equation (2.26a), with / = 0, 

{qI + Qj + Qi Qj = 0 

and then ujij = —ooji {oJn = 0J22 = 0 , lui2 — —uj2i)- As CJ12 is an arbitrary 
constant, we can put it equal to —1 and obtain at once 

I = qiq 2 -q 2 qi= c, 

that is, the conservation of angular momentum. 

If we now try to apply the spatial transformation depending on the ve- 
locity, 

r = 0, = aij Qj, {aij const. V i, j), 

we again obtain, with f — L and an = a 22 = 1, «i 2 = 0^21 = 0, the 
conservation of energy (we omit the calculations for the sake of brevity) . In 
this case we also have new evidence that the transformation corresponding 
to a given first integral is not unique. In fact, if the transformation is 



^2=^2, / = >C, 

by applying (1.B.54) with r = —1, we get 

^ T = -1, 

f = f_+Cr = 0, 

6 =ii +qir = 0, 

. 6 = ^2 + hr = 0, 

which is just the transformation considered above. If we now want to de- 
termine the transformation (or the transformations) corresponding to the 
conservation of the Laplace-Runge-Lenz vector, in this case the integral no 
longer being quadratic only in the q and g, we must refer to a more com- 
plicated transformation. As in the case of the anisotropic planar oscillator^ 
with (jU 2 = 2 uji considered in Sect. 1.7, let us study the transformation 

= Aijk qj qk, r = 0, {Aijk const. V i, j, k). 

Equations (2.26b) give 



Am qi qi + An2 q2 h + A211 qi q2 + A212 q2 h = 






A121 qi h + A122 q2 h + A221 qi h + A222 ^2 h — 






^ The analogies and the interdependencies between Keplerian motion and the pla- 
nar oscillator will be investigated at various places later in the book. 
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and, by integrating, we get 

2 (^111 Qi + ^112 Q2) Qi + (^211 Qi + ^212 Q2) Q1Q2 + 9i {qi^Q2, 92) = /, 

- {A221 9i + ^222 Q2) Q2 + (^121^1 + ^122^2) 91Q2 + 92 {91^ 92^ 9i) = 

^1,^2 being arbitrary functions. The last two relations can be satisfied by 
^211 = -^121, ^212 = ^122; hence / is of the form 

2 (^111^1 + ^112^2) ^1 + 2 (^^221^1 + A22292) 92 + (^211^1 + ^212^2) 9 i 92 +9, 

where 9 = 9(91^92) is a new function to be determined. By substituting in 
( 2 . 26 a), we have 

df . df . 89 . 89 . 1 .3 1 . .2 

+ A21l9l92 + -Aii29l92 + -A 22292 + ^2129192 

89 . 8g . 1 . .3 1 . .3 

+ ^^212 + 2^221^ 91^2 + ^^211 + -Au2 ^ 9 i 92 

= -^[Aiiiq\qi + Aii2qiq2qi + ^ 121 ^ 1^2 + ^i22<?i^2^2 

+ ^ 211^1 ^ 2^1 + ^2129291 + ^22\9i9292 + ^ 222 ^ 2 ^ 2 ] 

+ A\ii9\ + (-A 121 + ^ 112 )^ 1^2 T A\229i92 + A2ii9i92 
+ A221919I + ^2129192 + A22292' 

For this to be meaningful, it must be the case that 

^111 = ^222 = O5 ^221 = ~ 2 yli 22 , ^112 = — 2 ^ 121 - 

Then 

A212 = ^122 = A^ A211 — A\2i — 

A221 — — 2A, A\i 2 — —2B, Am = ^222 = 0, 
where A and B are to be determined. One obtains 

^ {Bqm - Aql), ^ ^ {Aq,q2 -Bql), 

and then g = —^{Aqi B q2), where we have set the integration constant 
to zero. Finally, 

f = -Bq2ql~ Aqiql + {Bqi +Aq2)qiq2 - ^{Aqi + Bq2). 

The generators of the transformation will be 

= Aq 292 + B {9192 — 2 g 2 ^i), ^2 = ^ (92^1 — ‘^9192) + Bqiqi. 
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Clearly, one of the two constants can be taken equal to zero, and therefore 
two choices are possible: .0 = 0, A ^ 0 and = 0, .0 7 ^ 0. Correspondingly, 
one will have two a priori different first integrals. Let us look at the first case: 
.0 = 0. Here 

f = -Aqiq^ + Aq2qiq2- An^, 6=^9292, 6 = “^(c + 92 9i)- 

The first integral turns out to be Ii = A{—cq 2 + // ^) . By choosing A = 
— \j [I ^ one obtains 

r c . qx 

h = -Q2 = ei, 

fi r 

that is, the component along the qi axis of the Laplace-Runge-Lenz vector. 
The second case, A = 0, will give 

f ^ -Bq 2 qj + Bqiqiq 2 - Bfi—, 6 = 5 ( 0 - 9192 ), 6 = -69191 

r 



and the first integral 
obtains 



I 2 = B{cqi + By choosing B = — 1/)U, one 



r c . q2 

h = qi = 62 , 

/i r 



that is, the component along the q 2 axis of the Laplace-Runge-Lenz vector. 
The conservation of the Laplace-Runge-Lenz vector corresponds then to the 
transformations having as their generators: 



^1 = 

.1 1 . c 

C 2 = -92^1 + 

fl fl 



^2 1 . C 

Cl = -qiQ2 - 

/i /X 

= 9i9i- 



(2.29) 



They are then “rotations” in the qiq 2 plane (with coefficients depending on 
velocity) and translations along the axes with constant coefficients.^ To sum 
up, through Noether’s theorem we have obtained all the first integrals of the 
Kepler problem. Obviously, concerning the real applicability of the theorem 
the remarks already made in Sect. 1.7 remain sound. Furthermore, one might 
object that, after all, the same result has already been obtained in a simpler 
way, though not in the context of a general procedure, and then the use 
of a general formalism even becomes unnecessary. The obvious answer is 
that cases more complex than the simple Kepler problem (actually not so 
simple) exist and, therefore, general methods help us when intuition cannot. 
Moreover, in simple cases also, Noether’s theorem enables us to penetrate 
better the nature of the problem. 

Let us now move on to apply, still to the problem of Keplerian motion, 
another general method, the method of integration of the Hamilton- Jacobi 



^ In this case, a direct geometric interpretation in the space of the qi and as in 
the case of the oscillator, does not exist. 
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equation by separation of the variables. We have already seen, in Sect. 1.15, 
this method as applied to the anisotropic planar oscillator (with uj 2 = 2cui) 
in parabolic coordinates. Let us now define, in contrast to that case, the 
parabolic coordinates A, z/ in the following way^^ 



q2 = 2Xiy, 



- ^Iql^ql = (2.30) 



In this case, the coordinate lines consist of two families of mutually orthogonal 
parabolas, with the axis coinciding with the qi axis. 

By substituting (2.30), the Lagrangian (2.25) becomes 



£ = 2 (A2 + 1/2) + 



A^ + 



From this, one obtains the momenta 



PA = 4 (A^ + A, Pi. = 4 (A2 + i/2) 



and the Hamiltonian 



H = pxX^PjyU - C = 

The Hamilton- Jacobi equation is 

8 (A2 + i/2) A 5A / \ 



1 p\ + pI _ P 

8 A^ + A^ -h ’ 



A^ -f- 



where the first integration constant = /i is the total energy. Equation 
(2.34) can then be written in the form 



lUdS^ 
8 V ^A 



+ -fi = aiX^ + ally‘s 



and can be satisfied by putting 



1 



8 V dX 



1 2 ^fds:y 1 2 



where 02 is a new arbitrary constant. Then we have for a 2 

0^2 = ^P\ -\P- ^ ^ 

Taking the half sum of the two different expressions, 

Q!2 = ^ (pI -pI) + - A^), 

and substituting a\ —% (with T-L provided by (2.33) as an explicit function 
of the canonical variables), we finally get 



^°The reason for this choice will become clear in Sects. 2.6 and 5.4 
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1 1 ~ 

8 A2 + 1.2 2^ A2 + I.2 



(2.35) 



If we now substitute (2.32) and (2.30) into (2.35), we find after some calcu- 
lations that 

1 

= “2 

In this case, therefore, the first integral connected to the constancy of e has 
been obtained in a quite automatic way, by separating the variables in the 
Hamilton- Jacobi equation. No particular search has been carried out, no ad 
hoc position has been adopted. 

Lastly, if we ask ourselves what is the conclusion that can be drawn from 
the application of Liouville’s theorem (Sect. 1.18), the answer is immedi- 
ate. Going back again to the three-dimensional formulation, if we denote 
by Qi^Q 2 ^Q 3 the components of r and pi, p2? Ps the components of r, the 
Hamiltonian for unit mass will be 

^ = \{P\ +pI+ pI) - m /\/ QI + Q2+ 93 - 

and the components of the angular momentum 



Cl = ^2^3 - ^3^2, C2 = qsPi - qiPs, C3 = qip2- q2Pi. 

We have already seen in Sect. 2.1 that H and c are constants of the motion. 
It is easy to check that for = cf -h c^ + C3 one has 

{c^,n)^0, {ci,n) = 0, (c2, Ci) = 0, i = 1,2,3; 

therefore, if we choose any one of the components of c, it represents together 
with H and a tern of independent integrals in involution. According to 
Lie’s theorem, this corresponds to the existence of six first integrals and then 
the system (of order six) is completely integrable. 



2.3 Bertrand’s Theorem and Related Questions 

Let us now tackle the problem of determining what are all the possible forms 
of central forces giving rise to closed orbits (i.e. Ad of equation (2.15) com- 
mensurable with 27 t). The question was solved by J. Bertrand in 1873.^^ 
To demonstrate Bertrand’s theorem, we shall make use of a more restrictive 
definition of stability than the already mentioned Lagrange-st ability. The dif- 
ference between the two definitions will be made clear later on. Now, we shall 
adopt the following point of view: all central forces, as we have seen, admit 
circular orbits (whether stable or unstable); if we slightly perturb a circular 

Bertrand: Theoreme relatif au mouvement d’un point attire vers un centre fixe, 
Comptes Rendus, LXXVII, 849-853 (1873). 
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orbit we can have a perturbed orbit, whether closed or not. It will be closed if 
the initial circular orbit is stable, open in the opposite case. Therefore closed 
orbits will be generated by those central forces which admit stable circular 
orbits. 

Starting from the equation of the radial motion, 



r — 



dV{r) 

dr 



and using the integral of areas = c, we have 



c2 _ dV{r) 
dr 



(2.36) 



(2.37) 



For a circular orbit, r = const. = tq, r = 0 and then (as we have already 
seen) 



dV{r) 



dr 



r=ro 



(2.38) 



If the circular orbit is perturbed, and writing x = r — ro, the perturbation, 
one obtains from (2.37) 



X - (^(x + ro) ^ 



dV{r) 



dr 



r=x-\-ro 



By expanding in series in the small quantity a:, we get 



1 -3— + ... 1 - 
^0 



dV{r) 



dr 



d‘^V{r) 



dr‘^ 



X + 



Finally, by using (2.38), we obtain 
dV{r) 



X + I 3 



dr 



d‘^V{r) 
ro dr'^ 



+ . . . X = 0, 



(2.39) 



where we have neglected all the terms of higher order than the first. 

Now, if the term in parenthesis in (2.39) is positive, then (2.39) is the 
equation of a harmonic oscillation, and this means that the perturbation 
oscillates around tq: we say that this condition must be satisfied in order 
that the circular orbit be stable (see Fig. 2.4). 

Actually, the condition SV'{ro)/ro + V"{ro) > 0 is the necessary and 
sufficient condition, as we have already seen, for the existence of a minimum 
of the potential Vef[{r) in r — ro (i.e. Lagrange-stability) . Now we consider 
this condition necessary only. For the circular orbit to be stable, it is thus 
pecessary that the term considered is positive and so corresponds to the 
squared frequency of the oscillations of the perturbation around tq: 



> 0 . 

ro 



(2.40) 
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Let us now move on to deal directly with the orbit. For an almost circular 
orbit, we have seen that r oscillates around the value vq (if the orbit is 
stable). From (2.40), the period of these oscillations is 



27T 27T 



(2.41) 



Let us evaluate the apsidal angle, that is the variation of the polar angle 'd 
during the time in which r oscillates from a minimum to the next maximum. 
The time is given, obviously, by l/2r ; to evaluate the angle swept out in this 
time we shall have to multiply by a mean angular velocity. Since = c/r^ 
and r oscillates around rg, we shall assume the mean angular velocity to be 






^0 



because of (2.38). The apsidal angle will therefore be 



1 n 1 T 7T 

— A'd 2:^ — T'd = — 



V'{ro) 

ro 



7T 



V^^(ro) 

roV"{ro) + 3V'{ro)' 



(2.42) 



Within certain limits, the apsidal angle will not depend on the particular 
chosen value rg. In fact, in the case of closed orbits the number of oscillations 
must be the same at any distance where closed orbits are permitted; otherwise 
one would have discontinuity in the number of oscillations when rg varies, and 
corresponding to this discontinuity the orbit could not be closed. Therefore 
we shall impose the condition that the quantity 

rV"{r) + 3V'{r) 

write r for the variable rg. 
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be a positive constant. Introducing the constant a > —2, we express this 
condition in the form 



rV"{r)-^3V'{r) 



1 

(2 + a) 



>0. 



(2.43) 



This condition is the necessary and sufficient condition we wanted for the 
stability of the circular orbit. At variance with Lagrange’s criterion, which 
only refers to the behaviour of (see (2.12)), now the polar angle is 

also involved (see (2.42)); therefore the conditions to be fulfilled are more 
stringent. By integrating (2.43), we have two cases: 



V{r) = Ain r (when a = 0) , 
V{r) = (a / 0, > —2), 



with A and k being integration constants. The logarithmic potential cannot 
give rise to closed orbits, since in this case Ai9 = ttv^, and then the apsidal 
angle is not commensurable with 27 t. 

We are left with the cases of V{r) = kr^ with — 2 < a < 0 and o; > 0. 
To determine what are the allowed values for a in these two cases, let us 
try to obtain in another way an estimate of the integral coresponding to 
I A'd (Arnold) . The method consists in evaluating the integral for a limiting 
situation, where it is easily evaluable; since, as we have said, it must stay 
constant when r varies, the value obtained is valid in general. From (2.15), 
we have 

2 rA/2r2 {h - V{r)) - 

For the case a > 0, let us consider the limit for r oo \V{r) — kr^ 
will tend to oo for r -> oo; for the denominator to remain finite, h must also 
tend to oo, while the last term will tend to zero. It is evident from this that 
the constant k must be greater than zero. Let us now make the change of 
variable u — cjr^ du — —{c!r‘^)dr\ then 

1 ^ ^ du 

I — — -= , 



with I4ff('^) = V{c/u) + u‘^/2. Let us further set y — then 




(2.44) 



with 






■V 



y^max 



taking into account that, for r — 
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(j 1 

h = h V^(rmin) ~ 2 ^ 



2 

max 



+ V{c/ 

'^max)5 



therefore 



Kff(l) = 



/i 



For h ^ QO, limax 
negligible. Thus 



Therefore, for a > 0, 



oo, 2/rnin 0 and the second term in Ves{y) is 




dy 



7T 

2‘ 



Z\^ = 2 



7T 

\/2 T q; 



= 7T, 



and a = 2; consequently V{r) — kr‘^ ^ with A; > 0 (the case of the planar 
oscillator) . 

Let us now examine the other case: — 2 < a < 0. It is convenient to put 
(3 — —a. Then 

V{r) — kr~^ (0 < /? < 2). 



The limiting situation convenient in this case is that given by h — 0 ; 
correspondingly r^in will tend to a finite value and Tmax to -\-oo (there- 
fore Umin 0, Umax ^ finite quantity). Moreover one sees that k must be 
negative. We shall therefore write 



V{r) = —ixr with /i > 0. 



Reconsidering (2.44), we have in the limit 






/' 



dy 



\J-2Ves(y) 



The radical quantity is 



-2 









2/Uma 



-P- 



= 



2/^vfn 



— 2-1 



But in Tmin 7 since /i = 0 , + I^(^min) = 0 as well, that 

(c/rmin) = ~ 

Therefore our radical is simply reduced to — y‘^ ^ and for AiJ 

dy 7T 






VW 



2-/3' 



1 ^ A 7T 7T 

— Ad = — , = — — ■ , 

2 y/2 + OL \/2 — [3 



But also 
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and so — (3 = 2-/3 and /3 = 1. As a consequence V{r) = that 

is, the Newtonian potential. 

After having demonstrated that the only central forces giving rise to closed 
orbits derive from V{r) = kr‘^ and V{r) = , Bertrand also tackled the 

problem of singling out the forces for which the closed orbits were ellipses. 
The problem was solved at the same time by Darboux and Halphen,^^ who 
demonstrated that the potentials V{r) = and V{r) = —{i/r give elliptic 
closed orbits. Therefore the closed orbits can only be ellipses. The Newtonian 
potential, as we have seen, gives Ad = 2 tt, while for the planar oscillator we 
get Ad — 7T. The two different values are not in contradiction, since for the 
oscillator the radius vector originates from the centre of the ellipse while for 
the Newtonian potential the origin is one of the foci. 

If we add to the Newtonian potential a “small” term (perturbation), this 
will cause an increase or decrease of the apsidal angle. The more frequent 
case in the solar system is that where the gravitational perturbation of the 
motion of a planet due to the action of the other planets is represented by 
a force of the type ejr^ (e is a small quantity and such that powers higher 
than the first are neglected) Generally there will be a force 




The angle Ad in this case becomes 



Ad = 2tt\ 



I V'jro) 

roV"{ro) + 3V'{ro) 
_ 1 



= 2tt 



= 27T 



1 — ek 



-l-r 



[l + ek-^r^'^ \ 



27T 1 



3 + ro 



fcrg 



llMl' 

f{ro) . 



where the higher powers of e/kr^ have been neglected. Therefore, the apse 
advances or regresses according to whether e is positive or negative. In the 
well-known case of Mercury, there is an advance of the perihelion of 574" 
in a century; calculations of the type mentioned above (obviously, with a 
higher accuracy) account for an advance of 531" in a century. There remains 
a discrepancy of 43" which, as is known, can be accounted for by Einstein’s 
theory of general relativity. 



^^A demonstration, simplified with respect to the original one, can be found in 
P. Appel: Traite de Mecanique Rationnelle^ 3rd edn. Vol. 1 (Gauthier-Villars, 
1909), Sect. 232. 

^'^A force of this kind is obtained as a first approximation of the effect exerted by 
a thin ring of gravitating matter (Gauss ring) in which we can consider the mass 
of a perturbing planet to be distributed, so to speak, along its orbit. 
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2.4 The Position of the Point on the Orbit 



The Eccentric Anomaly 



As we have seen, c and e completely determine the orbit, and, given certain 
initial conditions vq and Vq^ we can obtain from them c and e and then 
the orbit. However, we are not able to provide an explicit expression for 
r = r(t), that is, to give the position on the orbit at a certain instant of time 
(for instance ti) of our point rt=ti = r{ti). For the solution of this problem, 
it is convenient to make a change of variable and to put t = t{u). The new 
variable u, on account of the way in which it will be defined later on, is called 
the eccentric anomaly. 

From the equation of energy conservation in (2.17) we obtain 



(r r)^ + = 2 ()U r -h /i r^) . 



(2.45a) 



Let us now introduce the variable u such that 



du 






dr 

r(r)’ 



where k and T are two constants. Moreover, 



r = 



dr . dr k 

du du r 




where the prime means derivation with respect to u. By substitution, (2.45a) 
becomes 



k‘^{r')^ + = 2 (/ir + /ir^) . (2.45b) 



a) h = 0. Let us deal first with the simplest case = 0, furthermore making 
the choice k^ — fi. Equation (2.45b) gives 




= 2r. 



and, by taking the derivative, we get 2 r'r" = 2r' . Since r' ^ 0 (otherwise r 
would be constant), r" = 1, and then r will have the form r = ^ {u — uq)‘^-{-A. 
By putting ?io = 0 and determining A by substituting in the differential 
equation, one obtains 



r 



1 

2 



> c^ 

U^ ^ 



From this and from the definition of u, which, with u = 0 for t = implies 
fj. kdt = rdu^ we get 






, 1 3 

du = - u -\ u. 

6 2/i 
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To summarize, 

= = + ( 2 - 16 ) 

In the second of equations (2.46), t is a strictly increasing function of u. The 
equation can be solved and one uniquely obtains u = u{t) . to the meaning 
of the constant T, one must as usual distinguish the two cases c 7^ 0 and 
c = 0. 

For cy^O, /i = 0, and the orbit is given by the parabola 



r = 



eV/i 

1 + COS / 



(2.47) 



Since rmin = c^/2/i (corresponding to / = 0) - and this is also the value of 
r corresponding to = 0 and then to t = T - we have that t = T is the time 
of pericentre passage. 

If, on the other hand, c = 0 , we have 

6y^{t-T) = u^, r = 



Therefore, at t = T there is a collision at the origin. If T > 0, the collision 
will occur after the instant t = 0 and, from that moment, the motion will 
change. If T < 0 , then the point will be “emitted” from the origin and the 
motion will occur for T < t < 00 . 

Returning again to the case c 7^ 0 , if we compare the first of equations 
(2.46) with (2.47), we get 



2 1 — cos / 

/X 1 + cos / ’ 



that is. 



c / 

u = -—tan -, 

V7^ 2 



(2.48) 



which is the relation that enables us to pass from the true anomaly to the 
eccentric anomaly. Then it is possible to rewrite (2.46) by putting / in place 
of u by means of (2.48). There are tables which enable us to calculate / easily, 
if we know (t — T) and c^/2/x (Barker’s tables). If instead we want to know the 
position by means of the eccentric anomaly u, we proceed as follows: given 
Vq and Vq (at the time t = 0), by differentiating the first of equations (2.46) 
one obtains r = uii = uk/r, so that rr = r r = r v = and then 

ro • Vo = UQy/]l\ using uq (at the time t = 0), one calculates T, and inserting 
it again into the second of equations (2.46) one has u(t) and then, from the 
first, r{t). At this point, if c == 0, the position is known since we know the 
line of the motion e — const. If c 7^ 0 , there are two possible values of /, 
since cos / = cos(— /). Thus / > 0 for t > T, / < 0 for t <T. 
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b) h ^ 0. There are three possible kinds of motion: 

1) linear, if c = 0, 

2) hyperbolic, if c 0, /i > 0, 

3) elliptic, if c / 0, < 0. 

Again consider (2.45b) and now make the choice k‘^ = /i/a, where the con- 
stant a — fi/2\h\ has been introduced. By dividing by one gets 

(r')^ H = 2ar a (h) , with cr(/i) = 

/i |a| 

By adding a (h) to both sides and remembering, from (2.22), that c^//i = 
a (e^ — 1) a (h), one has 

(r')^ + a^e^a (h) = a (h) [a + a{h) r]^ . 

By defining g (u) through ea g = a a (h) r, we finally obtain 



ie'f -cr{h) = -cr(/l). 



Apart from the solutions ^ = ±1, which do not interest us, this equation is 
satisfied by 

g = cosh{u Ki)^ h > 0, 
g = cos {u + K2), h <0. 

Since T has not yet been fixed, we put Ki = K2 = 0. As a, consequence 
g = cosh u and g = cos u in the two cases respectively. Therefore we obtain 



r = a (e cosh u — 1) , h > 0, (2.49) 

r — a{l — e cos a) , h < 0. (2.50) 





-T) = r,r 








a3/2 


and 






\/M 




a3/2 



for h > 0 



{t — T) — u — e sin u, for h < 0. 



In the last case (elliptic motion), = 27 t/P = n, where P is the 

period and n the mean motion: one defines n{t — T) = I {mean anomaly). 
Also for /z 7^ 0, from equations (2.21) and (2.22), one has 



a|e^ — 1| 

1 -h e cos / 



r = 
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From this, Tmin = a|l — e| and, since t = T for u — 0, one has that, for 
c 7 ^ 0 , T is the time of pericentre passage. On the other hand, for c = 0 one 
has e = 1 and then r = 0 for iz = 0: in this case T is the instant of collision 
at the origin or of emission from the origin. 

As to the relation between u and the true anomaly /, we have from (2.22) 
and (2.49) that 

1 — cosh u 1 — e 1 — cos / 

1 + cosh u 1 -h e 1 + cos / 

and then, for > 0, 

f ll + e u 

tan — — ~ -. (2.51) 

For < 0, from (2.22) and (2.50), we obtain 

/ /l + e u , , 

tan - = y YZri 2 ■ 




Fig. 2.5 



In the last case, it is useful also to look at the geometric meaning of the 
eccentric anomaly. 

Referring to Fig. 2.5, where the ellipse has the semi- axes a and b and the 
circle radius a, one has 

QH r sin / b 

SH a sin u a 

from which rsin/ = 6sinu. Moreover rcos/ = a(cosu — e). Squaring and 
adding yields = 5^ sin^ u + (acosu — ae)^, and as 6^ = a^(l — e^), 
(l — 2ecosu + cos^ u), that is, r = a (1 — ecosu) . This proves 
that u is really the angle drawn in Fig. 2.5. That is, given the point Q with 
true anomaly /, one determines the point S by drawing from Q the line per- 
pendicular to the semi-major axis until it intersects (on the side of increasing 
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/) the circle of radius a and centre C. The angle SCP is then the eccentric 
anomaly u. Therefore (2.50) and 

n{t — T) — l — u — esinu (/i < 0, e < 1) (2.53) 

can also be rewritten with the true anomaly instead of the eccentric anomaly. 

For the two cases /i 7^ 0 (hyperbolic and elliptic motions) one can obtain 
r = r{t) by proceeding along the same line of reasoning as for the case 
h, = 0. Now, instead of two equations (2.46), we have (2.49) and n{t — T) = 
e sink u — u for h > 0 and (2.50), (2.53) for /i < 0 . Unfortunately, in both 
cases the relation between u and t does not consist of an algebraic equation 
as is the case for /i = 0, so it is not possible to obtain the solution in terms 
of elementary functions. We shall give a short account of the problems raised 
by (2.53), known as Kepler’s equation. 



Kepler’s Equation 

Despite its seemingly simple form, (2.53) has intrigued mathematicians for 
a long time and its solution by means of successive approximations is still 
being investigated. In the search for a solution, there are two approaches: 
an analytical one, which, by exploiting the properties of the sine function, 
allows for series expansions, and a numerical one, which, through the several 
methods of solution of nonlinear equations, provides approximations with 
different degrees of convergence and accuracy. 

Before giving an outline of the problems encountered in adopting the 
first approach, let us establish that Kepler’s equation has one and only one 
real solution for any value of t and 1. Given (2.53), consider the function 
(p{u) — u — esinu — I . Taking into account that for I = mn ( m any integer) 
u = mn , consider a value of I between rriTr and (m + 1) tt . Then 

(f{m7r) = mn — / < 0, (f [(m + l)n] = (m + l)n — / > 0. 

As a consequence, (p{u) = 0 has an odd number of real roots between mn and 
(m + 1)7 t ; but, since (p\u) = 1 — ecosu > 0 and so (p{u) is ever increasing, 
one also infers that (p{u) passes through the value 0 only once. 

Let us turn now to the classical methods based on series expansions. In 
the case we have to consider, 0 < e < 1, to obtain an expression of the 
function u (e, 1) we can choose between two possibilities: 

1) one can expand, for any fixed positive value of the eccentricity (e < 1) , 
the deviation of the eccentric anomaly from the mean anomaly in a Fourier 
series in the variable /, with coefficients depending on the fixed value of 

e; 

2) the solution u = u{e^ 1) of the equation can be written, for any fixed 
value of the mean anomaly, as a Taylor series of powers of the variable e, 
with coefficients depending on the fixed value of 1. 
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In case (1), if we consider that the function e sin u = u{l) — I = F (1) is 
periodic with period 2tt in odd, tends to zero for / -> 0 and I tt, and 
has a continuous derivative, we can expand it in a sine series 

oo 

F (I) — u{l) - I = ^ Bi sinz/, 

2=1 



with 






It is fairly straightforward to show that 

Ji^e) 

i 



1 

- / F{l)smildl. 

^ Jo 

low that 

, Ji (ie) . . , 

= / + 2 2^ — ^ — ~ sin i /, 



2=1 



(2.54) 



where the Ji are Bessel functions of the first kind.^^ Owing to the regularity 
of the function F (/) , series (2.54) converges uniformly for any value of I in 
the interval (— oo, +oo). It is interesting to dwell for a moment upon the 
question of the rapidity of the convergence or, in other words, of the degree 
of accuracy we succeed in obtaining by means of expansion (2.54). Since in 
practical applications series (2.54) will obviously be truncated, the problem 
will consist in evaluating the error made: the more rapid the convergence, 
the lesser the number of terms necessary to have a good approximation. If 
we evaluate the first six Bessel functions^^ and substitute them in expansion 
(2.54), we get 



u — / T 
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+ 



1 3 1 5 1 

_ P'3 I 

8 192 9216 
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1 



sin I 
sin 2/ 



125 



3125 

9^ 



sin 5/ 



sin 31 F 



27 



1 



- TT + — e' 



15 



80 



243 



45 



sin 61 + 



sin 4:1 
(2.55) 



If we assemble the terms having the same power of e, (2.55) can be rewritten 
as 



^ / + e sin / + — sin 2/ + ( - sin 3/ - - sin I 

2 V 8 8 



+ e 



^ ^ ^ sin 41 sin 21 \ F e 



27 






384 



4 . 1 . 

— sm 4t + ^ sm 2i 



5/ 125 . 27 . 1 . , 

' sm 5/ — sm 31 F ttt sm / 



128 



192 



(2.56) 



^^See, for instance, W. M. Smart: Celestial Mechanics (Longman, Green and Co., 
London 1953), Chap. 3. 

^®See W.M. Smart: op. cit.. Appendix. 
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It must be remarked that, as the eccentricity is very close to zero in most 
problems to do with the solar system, an expansion of the sixth order like 
(2.56) is a fairly good solution. 

Let us now consider the second possibility, that is, to expand, for any fixed 
value of the mean anomaly /, the solution u = u{e, 1) of Kepler’s equation 
as a Taylor series of the eccentricity around e = 0, 

OO j 

u{e,l) = Y,c,{l)^^, (2.57) 

j=0 ^ ' 

with coefficients depending on / 



/ u (e, /) 

V de^ 



e=0 



(2.58) 



We shall see that the functions Cj (/) are trigonometric polynomials in the 
mean anomaly. The problems presented by expansion (2.57) are: 



a) to evaluate an explicit expression for the coefficients c^; 

b) to determine the radius of convergence of the power series. 



Let us begin from (a) by resorting to an expansion due to Lagrange. Consider 
the equation 

z — w-\-a^{z), (2.59) 

where the variables 2:, w indicate real or complex quantities and a is a pa- 
rameter. If 7 is a smooth closed curve such that at all its points 

<1, 

z — w 



let us assume that ^ (z) is holomorphic inside 7 and on the curve itself. It 
was shown by Lagrange that (2.59) admits one and only one root inside 7 
and that any function F (z) holomorphic in the same domain bounded by 7, 
having such a 2: as a root, can be expanded in a power series of the parameter 
a, convergent in 7, as follows: 



a‘^ d 



F{z)=F {w) + {w) F'{w) + ^ ^ H F\w)] 









that is 



a'^ d 



F (.) = F (u.) + ^ ^ ^ {F'{w ) !«. (»)]"}. 



n—1 



If, in particular, we choose F (z) = z, then 

z = w+y — r[^(w;)r 



(2.60) 



(2.61) 



(2.62) 
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Equation (2.53), rewritten in the form 

= / -h e sin u, 

is a relation of type (2.62). Therefore, putting F {z) = z = u, ^{z) — 
sin u, w — I ^ a — e and demanding that the functions fulfil the required 
conditions, we have 

OO j jj — ^ 

u = l + J2j — (sin ly . (2.63) 

Hence the coefficients Cj (/) have the form 

Cj (/) = Y sin-^ I, j = 1, 2, . . . ; cq (1) = I, 

but, since the j-th power of sin /, because of de Moivre’s formula, is a linear 
combination of 1, cos/, cos 2/, . . . , cos jl or sin/, sin 2/, . . . , sin j7 (for even 
or odd j respectively), the {j — l)-th derivative of sin / is a linear combination 
of only sines; therefore we end up with 

As a consequence, the first terms in the expansion of the solution of Kepler’s 
equation will be 

u =: I e sin I F — sin 2 / + • • • . (2.65) 

It must be remarked, however, that the terms of higher degree are consider- 
ably more complicated to evaluate, in comparison with those written above, 
and the difficulties in the calculations increase rapidly. On the other hand, 
we usually have to deal with very small eccentricities, so that even the three 
terms we have written provide us with an accurate value of the eccentric 
anomaly. When this is not so, an option of great practical value may be that 
of resorting to graphical methods to solve the equation by determining the in- 
tersection of the sinusoid y = sin u with the line of the equation y = {u-l)/e 
in the uy plane. If we now go back again to (2.65) and compare it with (2.56), 
we see that the two expressions practically coincide except for the fact that 
the coefficients of the expansion depend in one case on the fixed value of the 
eccentricity and, in the other, on the fixed value of the mean anomaly. 

The formal identity between the two expansions was pointed out by La- 
grange but his result is, however, of limited validity since it is possible to 
deduce that the Lagrange series converges on the whole of the real axis (that 
is, for any value of / on the real axis) only for small values of the eccentricity. 
In fact, a careful analysis of the Fourier expansion, by exploiting the prop- 
erties of Bessel functions, shows that the series converges, for any real value 
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of the mean anomaly, only if e < 0.6627434 .... These conclusions were the 
outcome of a long period of research: in fact, though Lagrange had already 
derived expansion (2.65), he did not come to a rigorous conclusion about the 
convergence. Several decades later, Laplace also tackled the problem but a 
rigorous proof was given only by Cauchy; finally, Rouche simplified Cauchy’s 
demonstration remarkably. It must be emphasized that the invaluable contri- 
bution to the development of the theory of complex functions due to Cauchy 
originated just in the task of solving the problems of the convergence of ex- 
pansion (2.65): in fact, his fundamental discovery connecting the radius of 
convergence with the position of the nearest singularity was made when he 
was studying (2.65). 

It remains now to speak about the methods of solution involving succes- 
sive approximations. In the last three decades, Kepler’s equation has roused 
new interest in consequence of the requirements of space dynamics. At the 
same time, the increased power of computers has made people give up the 
analytical solutions. In fact, direct analytical solutions are rather inefficient 
from the computational point of view and, in particular, power series expan- 
sions are rather troublesome to calculate; the consequence is that iterative 
methods have practically supplanted the classical analytical ones. In spite of 
this, in the past a very large number of methods for the solution of Kepler’s 
equation were proposed. In 1914 Moulton wrote: 

A very large number of analytical and graphical solutions have been discovered, 
nearly every prominent mathematician from Newton until the middle of the last 
century having given the subject more or less attention. A bibliography containing 
references to 123 papers on Kepler’s equation is given in the Bulletin Astronomique, 
January 1900, and even this extended list is incomplete. 

Among the various methods suggested for the solution of nonlinear equa- 
tions of the type / (i^) = 0, where / is given and one wants to find u, almost 
certainly the most famous is Newton’s method, which has a quadratic con- 
vergence property and is also known as the Newton-Raphson method. This 
has been the method traditionally favoured among those making use of suc- 
cessive approximations to solve nonlinear equations. It is simple to apply, 
and, in the case of Kepler’s equation, the possibilities of divergence were 
practically negligible, and local quadratic convergence was undoubtedly sat- 
isfactory. However, nowadays there are more efficient algorithms which ensure 
higher-order convergence without extra cost; the secant method in particu- 
lar may become less expensive. Anyhow, Newton’s method remains the best 
starting point for the systematic development of higher-order algorithms. 

We cannot go into the details of the various iterative methods of solution 
of Kepler’s equation here: it is a specialized and technical subject for which 
we refer the reader to the papers mentioned in the notes to this section. We 
confine ourselves to mentioning some general distinctive features. To solve an 

R. Moulton: op. cit., p. 190. 
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equation by means of an iterative method means to determine the limit of 
successive approximations constructed by making use of a uniform (iterative) 
procedure. One of the advantages of the use of these procedures is that they 
are simple and easily applicable when one uses computers. On the negative 
side, they present considerable difficulties for instance in the choice of the 
starting point of the iteration or with respect to convergence. It must indeed 
be said that the convergence of the procedure may become so slow that a 
satisfactory approximate solution cannot be obtained. It is for this reason 
that such methods have a limited range of applicability. Also it must not be 
forgotten that numerical methods are in general affected by errors (errors 
in the initial data, truncation errors, round-off errors). In solving Kepler’s 
equation with numerical procedures, therefore, one will have to take into 
account that the solution is the result of successive approximations, and as 
such it will have a degree of accuracy that depends unavoidably on several 
factors: on the order of the method, on the number of iterations and on the 
choice of the starting approximation. 



2.5 The Elements of the Orbit 

The Motion of the Point in Space 

So far we have been dealing with the determination of the orbit (its geometric 
shape and size), its orientation in space and the position of the point on the 
orbit itself: c, e and h fix the geometric shape and the size and the plane of 
the orbit is the one orthogonal to c; moreover through u one can determine 
the position of the point on the orbit. In the previous section, we argued 
about the determination of u as if vq and vq came out automatically from 
the observations. As a matter of fact, this is not a trivial problem, because Vq 
and Vo are not directly measurable but have to be obtained indirectly from 
the measurement of other quantities. We cannot extend our exposition by 
explaining the relevant procedure and so refer the reader to a textbook on 
theoretical astronomy. In any case, this does not change the substance of 
our conclusions: we started with system (2.16) to determine r = r (t, Vq, vq) 
and, instead of finding ourselves with the desired result, we have determined 
quantities, called elements of the orbit, connected with the first integrals 
found, which do not exhibit any simple relation to the components of r{t). 
On the other hand, it is clear that the elements are the quantities more im- 
mediately susceptible to measurement by means of astronomical instruments. 
Therefore the question is raised whether or not the Newtonian schematiza- 
tion of the problem (system (2.16)) is well suited to representing the features 
of the orbital motion. 

^®See also F. R. Moulton: op. cit., Chaps. V, VI. 
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Before answering this question, let us draw up a list of the orbital el- 
ements, which will also be useful for subsequent developments. Figure 2.6 
refers to an elliptic orbit of a generic planet (different from the Earth) of the 
solar system. In it we have the following 

S = Sun, 

P = pericentre (perihelion), 

A = apocentre (aphelion), 

CQDQ' = orbital plane, 

Ef2E'f2' = plane of the ecliptic, 

ECE'D = circle representing the celestial sphere, 

= line of the nodes, 

7 = vernal equinox. 

The orbital elements are: 

i = inclination, 

i? = longitude of the ascending node, 
e = eccentricity, 
a = semi-major axis, 
uj = argument of the perihelion, 

T = time of perihelion passage. 

Alternatively, one can use uJ = Q lo (longitude of perihelion). 




Fig. 2.6 
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Delaunay’s Elements 

We commented above on the inadequacy of the quantities of the Newtonian 
scheme which enter at the beginning in the setting of the 

problem, in representing the solution of the problem itself. Furthermore, if 
we take into account that in any real problem (for instance, in the solar 
system) one must consider the variations (even if small) to which the orbit 
is subject in the course of time, it stands to reason that the variables of 
Newtonian mechanics are less suited for a precise treatment. In fact, the 
study of the variations of the coordinates x^y^z (quite destitute of regularity) 
is not the best way to understand how the orbit of a planet is changing as 
a consequence of the presence (not always negligible) of the other planets. 
Therefore we need to have, as elements of the orbit, quantities that are at 
the same time dynamical variables or, vice versa, to write the equations of 
motion directly for the elements of the orbit. This is possible, as we shall see, 
in the Hamilton- Jacobi theory; the fact that the equations of motion give 
the time derivative of the elements of the orbit directly will then allow the 
formulation of canonical perturbation theory. 

Let us now consider, as usual, an inertial system in which x^y^z are the 
coordinates of our unit mass point and Px = x^ Py — Pz — ^ the com- 
ponents of its momentum. The six quantities {x^ y^ z\ Px^ Py^> Pz) constitute 
the set of canonical variables describing our system. By means of a canonical 
transformation^^, we move on now to another set of canonical variables given 
by the spherical polar coordinates p defined by 

X — r sin'll cos 
y = r sini? sinc^, 
z — r cos 'd, 



and their conjugate momenta 

Pr — Pi? = '^5 Pif = sin^ p. 

The Hamiltonian for Keplerian motion will be 






and the relevant Hamilton- Jacobi equation 



pI 






dr J 



1 



dSV 1 fdSV 
1 + ^ ^ + 

Xov/ sm‘ 



2 Gin^ ^ \dp J 









= Ctr — 



( 2 . 66 ) 



(2.67) 



^^The treatment that follows will be repeated in more general terms, that is, for 
a general central potential, in Chap. 5. For these reasons, we do not dwell upon 
the details of the demonstrations. The reader can obtain all the missing proofs 
by putting ^(r) = —y/r in the treatment of Sects. 5.2, 5.3. 
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since 

Pr = -^, = 5 = 5(r,i?,(^; J^, J^, J,^). (2.68) 

If we put S = Sr {r) -h Si^ {'d) + (c/p), (2.67) splits into three ordinary 

differential equations: 



dS^ 

dd 



dSr 

dr 



2 

+ 



dS^ 

d(f 

sin| 



a, 






(f 2 



q;: 



~‘^=2h. 



(2.69) 



Immediately we see that a^p = sin^ d (p , which is in fact the compo- 

nent of the angular momentum relative to the polar axis (t? = 0). Moreover, as 

(_^2 p _ ^2 ^ g-j^ p ^Qg p ^Qg ^^2 p ^Qg _ -^-2 ^ g-j^ ^ ^Qg ^ g-j^ 

are the other two components of the angular momentum, from (2.68) and 
(2.69) one obtains 



= \ vl + 



P% 



sm 



d 



= yj (r^ dy -h (r^ sin d pY = 



r X r 



that is, the modulus of the angular momentum 

— c = >//xa (1 — e^). 



Therefore the vector rxr = c has constant length and direction, and the orbit 
is planar and lies in a plane orthogonal to the total angular momentum. If i is 
the angle between the plane of the orbit and the xy plane, then cos i 

{i is the inclination). Hence, the three canonical constants in our case are 



c = yj fjia{l — e^), (2.70) 

cos i = yj fia{l — e^) cos i. 



OLr — 
= 

Oiipt 



For the elliptic Keplerian orbit, (/p is a cyclic coordinate, and therefore there 
is a rotation. As regards r and d^ which oscillate between a minimum and 
a maximum, there is libration. The limits for r are Vq = a (1 — e) and 
n — a{l e), and for d the limits are tt/2 ± i. 

To evaluate the action integrals 






/ 



pad'll), 




(2.71) 
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we have to substitute 



= \/ 2 ^ + ^ ~ 



r - y - sin2 ^ ’ 

obtained by (2.68), (2.69). The result is (see Sects. 5.2, 5.3) 

Jr — ^'d \/ M J'& — 

and, recalling that h = —/i/2 a, we get 






J 



h = -- 









2 {Jr + Q^t?) 2 {Jr + + J(^) 

If we calculate the three frequencies cur^ ^^9, (^ip^ 



(2.72) 



(2.73) 



(2.74) 



LUr = 



dh 

Wr 



= 



dh 

dJi 



00^ 



dh 



we obtain 



UJr — — 






(2.75) 

( + J^ + J^f 

The system is therefore completely degenerate (as one would expect, since 
the motion is simply periodic). Instead of calculating the angle variables 
conjugated to J^, J^, Jp , we first perform a canonical transformation to 
new action-angle variables Ji, J2, J3 and 0i, 62^ Os in order to get only one 
frequency uui ^ 0 and o;2 = ^3 — 0 (that is, O2 and 6s will be constant). 
The transformation will be generated by the function 



Ws — — 9 \ {Jr + + Jp) — 62 {J-d + Jp) ~ Os Jp 

and will give 

0 1 = Or, 

02 = 0 ^ — Or, 

0s = Op - 0^, 

From (2.70), (2.72-74) and (2.77), we obtain 



J\ — Jr J'd Jp") 
J 2 — J 4 ” Jpt 

J3 ~ Jp-> 



(2.76) 



(2.77a) 

(2.77b) 

(2.77c) 



Ji = ^/]Ta, J2 == = y//ia (1 - e^), cosi, (2.78) 

1 /i2 -2 






jr 



Pr - \l2h{Ji) + 



= \di- 



Jl_ 

• 2 ’ 
sm (/? 



(2.79) 

Pp = ^3- (2.80) 
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Calculating ^ 2 , ^ 3 , we get 




For the first integral, we have from (2.80), (2.79) 




(2.81) 



By substituting (2.50), (2.77), (2.79), we finally obtain 

9\ — J (1 — e cos u) du — u — e sinu = n {t — T) = 1. 

Thus the first of the angle variables is the mean anomaly: 9i — 1. For 92 
and 9s, the calculations are a little more complex and we refer the reader to 
Sect. 5.2, confining ourselves here to giving only the result: 

92 — — cj, 9s — 



therefore 9s is the longitude of the ascending node and 92 the argument of 
the pericentre. Collecting together the results obtained, we have 

= n (t — T) = /, Ji = y/TTd = L, 

02 = ^ - = 0, J2 = \//xa(l - e^) = G, (2.82) 

03 = i? = h, 03 = \//ia (1 — e^) cos i = H. 



The six simbols g^h^L^G^H were introduced by Delaunay and are 
named after him. According to what we have said in Sect. 1.17, the Delaunay 
elements (2.82) are determined uniquely up to a linear transformation with 
integer coefficients; then any linear combination with integer coefficients of 
these elements again gives a set of action-angle variables. Five of the six el- 
ements (2.82) are obviously constant and characterize the orbit; let us now 
write the canonical equation for the non-constant element. 

The Hamiltonian will be 



and the equation 



1 _ I g? _ 1 g 

2 jf ^ “ 2 Z 2 ^ ~2 a 



dl _dn _ g^ _ g^ 
dt~ dL~ L^~ {gaf/‘^' 



(2.83) 



(2.84a) 
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that is, n = ; squaring this yields 

— (2.84b) 

Therefore, the canonical equation (2.84a) is Kepler’s third law (2.84b), and 
so the close connection between the Delaunay elements and the dynamical 
problem clearly appears. 



2.6 The Problem of Regularization 



In the last hundred years the regularization (of the equations of motion) has 
been studied and obtained with motivations that have changed as time has 
gone on. The initial demand for demonstrating the existence of solutions for 
the equations of motion, represented by convergent series expansions, has 
altered little by little in the quest for formulae to be used in the best way 
in modern computers. The fundamental research which focused the problem 
goes back to the studies of Levi-Civita (1903 and subsequents) and Sundman 
(1907, 1912) on the regularization of the three-body problem. 

In recent years a method has been introduced (K-S transformation) by 
Kustaanheimo and Stiefel (1964, 1965), which extends in a four-dimensional 
parametric space the Levi-Civita transformation for the planar case. The 
method of the K-S matrix enables one to obtain great accuracy in those 
numerical calculations concerned mainly with the motion of satellites. In this 
section we shall deal exclusively with the two-body problem; therefore the 
aim is that of attaining a more suitable form (for numerical calculations) of 
the equations of motion. If we rewrite the relevant system as follows: 

r = v, v = (2.85) 

we see that it has a singularity at the origin r = 0; in other words, at this 
point the Newtonian force becomes infinitely great. Even if, in practice, it 
never occurs that one reaches r = 0 since the real bodies (both natural 
and artificial ones) have finite size, it is nonetheless necessary to eliminate 
the singularities so as to make the numerical integrations feasible without 
difficulty. This is particularly important for the motion of artificial satellites 
when they are “very close” to the centre of attraction, whether on leaving or 
on arriving. The same can occur, in the case of highly eccentric orbits, for the 
pericentre passage. The procedure for eliminating the singularities from the 
equations of motion, or more generally from a system of differential equations, 
is called regularization. If we have, for instance, a differential equation 



dx 






^°See T. Levi-Civita: Questioni di Meccanica Classica e Relativista (Zanichelli, 
Bologna, 1924). 
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we say that it is “regular” in (xq, yo) if in an open domain enclosing (xq, yo) 
both / (x, y) and df/dy are continuous and bounded. In this case, the theo- 
rem of existence and uniqueness guarantees that a solution exists and is the 
only one satisfying a given initial condition. In the opposite case, the equa- 
tion is singular in (xq, yo) and, in general, one cannot guarantee the existence 
of the solution. The argument can be immediately extended to a system of 
differential equations. It is quite natural to ask if it is possible to eliminate 
the singularities from the equations in which one is interested by means of 
a change of variables, which enables one to transform the equation, or the 
system of equations, into a regular equation (or system). Sundman (1912) 
gave a positive answer in the case of the three-body problem. It must also be 
stressed that the “regularization” we are speaking of concerns the equations, 
not their solutions: in fact singular equations may even occur which have 
nonsingular solutions. 

As a first step in tackling the regularization of the two-body problem, we 
will show that the introduction of the eccentric anomaly is actually a proce- 
dure of regularization. If, for system (2.85), we introduce polar coordinates 
r, in the plane of motion, for the radial motion, the integral of areas and 
the integral of energy, we have respectively 



r - 


(2.86) 


r'^ d = c, 


(2.87) 


1 ,o n 1 


(2.88) 


By substituting (2.87) into (2.86), we get 




1 

co| to 

II 

1 


(2.89) 


If we define u (eccentric anomaly), as in Sect. 2.4, 




1'^ 

II 

^ 1^ 


(2.90) 


and substitute 

d _ d 




dt r du 





in (2.89) and (2.88), we obtain 



2\h\ 



J.2 ’ 






^ o ^ 

r 2 



where the prime means derivation with respect to u. Substituting the second 
equation into the first, yields 

^^See, for instance, E. L. Stiefel, G. Scheifele: Linear and Regular Celestial Mechan- 
ics (Springer, 1971), Sect. 4. 
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JL 

2\h\ 



a, 



(2.91) 



where we have taken into account that we are considering the case h < 0. 
Equation (2.91) is the regularized equation of motion (in comparison with 
the “singular” one (2.89)). It is easy to verify that, if we impose the initial 
conditions r(0) = a(l — e) , (i.e. for u = must be the pericentric distance) 
and r(0) =0 (in the apses the radial velocity vanishes), then from the general 
solution of (2.91) 

r — A cos u-\- B sin u-\- a 

we have immediately that r = a (1 — e cosu), which is (2.50). Therefore the 
eccentric anomaly -u is a regularizing variable for Keplerian motion. 



The Levi-Civita Transformation 



At this point, we can even aim at more general results, by considering 
system (2.85) expressed through rectangular coordinates in the plane of 
motion. From now, therefore, we shall consider that r = (xi, X 2 ), and 
V = f = (xi, X 2 )- If we also put dt = rds and denote by a prime derivation 
with respect to the new independent variable s (the “fictitious time”), the 
equation of motion and energy integral will be 



r" - - r' + /i- = 0, 
r r 



1 I P 

2 




(2.92) 

(2.93) 



It is immediately seen that neither is (2.92) regularized nor can it be regular- 
ized by substituting (2.93), since now (2.92) represents two scalar equations 
and the energy integral by itself is no longer equivalent to the equation of 
motion as in the case of the radial motion. Thus a further transformation is 
required. Levi-Civita obtained the regularization of (2.92) by means of the 
transformation 

xi = ul — ul, X 2 = 2i^it^2 (2.94a) 

from the vector r = (xi,X 2 ) to the vector u = {ui^U 2 )'f the X\X 2 plane is 
called the physical plane and the plane U 1 U 2 the parametric plane.^^ Equa- 
tions (2.94a) become particularly meaningful if we consider the relevant com- 
plex planes. In place of (2.94a), we have the (conformal) transformation 

z = Xi -\- ix 2 = {ui + iu 2 )‘^ = w^. (2.94b) 



^^Equations (2.94a) are the same relations as those we introduced in (2.30) to define 
parabolic coordinates. However, in this section, unlike what done in Sect. 2.2, we 
shall look at the curves in the u\U 2 plane corresponding to x\ = const., X 2 — 
const., and then they will be hyperbolas in the u\U 2 plane. 
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We can immediately check that (2.94b) is a conformal^^ transformation by 
ascertaining that the Cauchy-Riemann conditions 

dxi dx2 dxi dx2 

a — « — ’ a — = 

OUl OU2 OU2 OUi 

are satisfied owing to (2.94a). 

In the U 1 U 2 plane, the curves corresponding to X\ = const., X 2 = const, 
are two families of mutually orthogonal hyperbolas; moreover, if we consider 
the portion of the U 1 U 2 plane bounded by two hyperbolas of a family, it is 
easy to see that it is transformed in an infinite strip parallel to one of the 
axes in the X 1 X 2 plane. From (2.94a) and (2.94b), |r| = |ttp, that is, the 
transformation of the U 1 U 2 plane into X\X 2 “squares” the distances from the 
origin; as regards the polar angles, they are doubled. In fact, if is the 
polar angle in the X 1 X 2 plane and '0 the corresponding angle in the U 1 U 2 
plane, then 



tan 'd = — 

Xi 



2u\U2 2 U 2 IU 1 

ul -ul 1 -{U2/U1Y 



2 tan0 
1 — tan^ 0 



tan 20. 



Then, to obtain the image of a point, one has to square its distance from 
the origin and to double its polar angle. For a closed curve described by a 
point moving in the parametric plane, there will correspondingly be a curve 
described twice in the physical plane. Inversely, to a complete cycle in the 
physical plane there will correspond a half-cycle in the parametric plane. On 
the other hand (2.94b) is obviously not one to one. To a point Wo one and 
only one point zq = iCg corresponds, while to a point Zq two values of w 
correspond: w — w = 

It is convenient now, also in view of future extensions, to rewrite the Levi- 
Civita transformation in matrix form. By representing r and u by means of 
column vectors, (2.94a) becomes 



Xi 

X2 



U\ 


-U2 \ 


! ui 


U2 


Ui ) 


V'^2 



which can be written as 

r — L{u) u. 

One can also immediately verify that 



r' = 2 L{u) u' . 



Moreover the matrix L{u) has the following properties: 

a) £^(m) i(«) = Itipl, L~^ (u) = L'^ (u) 

\u\ 



(2.96a) 

(2.96b) 

(2.97) 



^^Obviously, only at the points at which dz/dw 7^ 0. 

^“^At the origin dzjdw — 0; therefore there the transformation is not conformal. 
^^See Stiefel-Scheifele: op. cit. 
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Fig. 2,8 



(where 1 is the identity matrix), and then it is an orthogonal matrix. There- 
fore one can solve (2.97) and get 

b) The elements of L (u) are linear and homogeneous functions of the Ui, 
and then 

L {u)' = L {u'). 

c) The first column L {u) is given by the vector u itself. 



Putting off discussion of how the L matrix can be extended to three- 
dimensional space, let us again turn to our problem, that is, the regularization 
of (2.92). By differentiating (2.97) and substituting into (2.92), one finally 
obtains 

^ _ //li' . \ 

= 0. (2.98a) 



u" + 



{u • u) 



By squaring (2.97) and substituting into (2.93), one also has 

1 



f - i^f 



h. 



It • 



Consequently (2.98a) becomes 



u" — hu = 0. 

2 

For Keplerian elliptic motion, < 0, the components will be 






'^2 + \ \h\u2 = 0 , 



(2.98b) 

(2.99) 



that is, the equations of motion of a planar oscillator in the parametric UiU 2 
plane. To study the peculiarities of the Levi-Civita transformation, we shall 
confine ourselves to considering the case where (2.99) corresponds to an el- 
liptic orbit, that is, the two oscillations have a phase difference of tt/2: 



U 2 = 13 sino;5, (2.100a) 



ui = a coscjs. 
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with ijo — ^J\h\/2. From the definition of the eccentric anomaly, we have 
uj s — u/2, and (2.100a) become 



U o • ^ 

Ui = a cos — , U 2 — fj sm — . 



(2.100b) 



Let us assume that ellipse (2.100a) has its minor axis on the u\ axis (/? > a). 
From (2.94a,b) 



2 2 ^ /o2 • 2 

xi = a cos — — p sm 



. 2 ^ 



( 2 . 101 ) 



X 2 = OL sini^. 



Moreover 



l~2 2 2 2 or — + [ 3 ^ 

r = y xf + X 2 = Ui U 2 = cos u H . 

Equations (2.101) are the parametric equations of an ellipse in the x±X 2 plane. 
It has its centre at the point 






X2 = 0 , 



and the semiaxes are respectively a — (a^ + /^^)/2, b — a(3 (see Figs. 2.9 
and 2.10, which are not, obviously, at the same scale). 

Also 2ae = j3‘^ ~ and then the expression for r becomes 

^ 2_^2 

r = ^ ^ ^ cos u — a ( \ — e cos u ) , 

which is again (2.50). 





168 2. The Two-Body Problem 



Regularization by Means of Conformal Transformations 
in the Lagrangian Formalism 



Besides (2.94), a certain number of conformal transformations have been 
set up to regularize the equations of motion. The search for these transfor- 
mations has been particularly sedulous in the case of the restricted three- 
body problem, which will be discussed in Chap. 4. However, we now want to 
show how, in the Lagrangian formalism, it is possible to state the problem 
in a completely general manner for systems with two degrees of freedom. 
We shall confine ourselves to natural systems, deferring to Chap. 4 the case 
where the kinetic energy contains terms linear in the velocities. Therefore, 
consider a conservative system described by means of the Lagrangian co- 
ordinates = xi, Q 2 — subjected to forces deriving from a potential 
U = U{xi, X 2 ). Thus we shall have the Lagrange equations 



^ (—'] - — -0 — - — 

dt\dxi) dxi ’ dt\dx2j 8x2 



( 2 . 102 ) 



with C — T+U and h — T—U — constant. Now, let us consider the complex 
variable z = X\-^ix 2 and the transformation 2 : = f{w ) , with w = U\-]-iu 2 . 
If the Cauchy-Riemann relations (2.95) are valid, then the transformation 
2 ; = /(rc) is a conformal transformation at all the points where /' (re) 0. If 
we assume U\^U 2 to be new Lagrangian coordinates, owing to the invariance 
of the Lagrange equations under a transformation of variables, we can write 
the new system as 



d 

dt 



dC 

dill 



uUi 



A ^ - — - 0 

dt \ dii2 ) du2 



(2.103) 



where 



dXi{ui,U2) 


2 

-1- 


dX2{ui,U2) 


dt 




dt 



+ U[xi{ui,U2),X2(ui,U2)]. 



If we call J the Jacobian determinant of the transformation 




because of (2.95). Therefore, 

C = ^J {ill + iil)+ u. 



By substituting (2.105) into (2.103), we obtain 



(2.105) 
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d , ^ ^ dU 1 dJ , 2 . 2\ n 

d dU 1 dJ ..2 ■2\ 

If, finally, we transform the independent variable by putting 

dt = J dr^ 



(2.106) 



we have 



<Pui ,d{u + h) , w 



d^ 



U2 



dr‘^ 



OU2 OU2 



taking into account that /i is a constant and T =U h. Hence 

U2 d 



d‘^ui d , ^ . .. 



dr^ du[ 



■[J{U + h)]. 



(2.107a) 



Clearly, if the transformation we are considering is such that JU — const., 
the equations of motion become 



d2 



Ui 



dr‘^ 






cP 



U2 



dr‘^ 



^{Jh) = h^. (2.107b) 

OU2 OU2 



In addition, whether (2.107b) are regularized or not will depend on the form 
of J (as a function of ^ 1 , 1 ^ 2 ). In the case of Keplerian motion, as in (2.94a), 
(2.94b), we have 



J = 4 {ul + ul) , U 









,2 • 



Therefore JU is constant and (2.107) give 

^{u\+ul) 



dJ u\ d 

dr‘^ dui 

(Pu2_ d 

dr‘^ du2 



^{u\ + ul) 



= 8 /iiii, 

= Shu2- 



Denoting with the prime derivation with respect to r, for ft, < 0, these become 
u'l + 8 |ft| ui = 0, u'^ + 8 |ft| U2 = 0, (2.108) 



that is, the equations of motion of the planar oscillator with frequency uj = 
a/ 8 |ft|. The difference, by a factor of 16, with respect to (2.98) is due to the 
fact that the two fictitious times s and r differ by a factor of 4. 

It is interesting now to look at the transformations undergone by the first 
integrals. Let us begin with the energy integral, in the transformation from 
the variables xi,X 2 ,^ to the variables ui^U 2 jT. Then 
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^J{ul + iil)-U = h. 



and, multiplying this by J, we get 



\ («1 +uj) = ^ 



dui'S.'^ / du2\^ 

dr J \ dr J 



J{U + h). 



When the transformation is such that JU = const. = then T — J h — 
where 



T 






(2.109) 



is the “new” kinetic energy and —hJ = V the “new” potential energy. In 
the Kepler case seen above 



1 

2 




du2 

dr 



2 -| 

+ 4\h\{ul + U 2 ) = 4/i; 



therefore the total energy of the oscillator is given by 4/i. 

In the case of central potentials U{xi,X 2 ) = U{y/xi + x^) = U{r), one 
also has conservation of angular momentum, equation (2.87), where d is the 
polar angle measured from the positive Xi axis. Recall that under a conformal 
transformation 2 : = /(^), every linear size is subject to a magnification by 
a factor \dz/dw\~^ . The angles obviously remain unchanged (the magnitude 
as well as the sense of rotation are preserved), except for the points where 
dz/dw — 0. Let us suppose that at the point a one has {dz/dw)\w^a = 0 
and the derivative has a “zero” of order a (when a function f{w) has the 
expansion 

00 

/H = 

n=0 

about any point w = a, one says that f{w) has a zero of order a in a when 
Jq = , fc-i = 0,/c 0); in this case the angle d is transformed 

into the angle ip = ad. If we now consider the planes X\X 2 and U\U 2 and the 
angles dd and dp with the vertices at their respective origins, the relevant 
elementary areas dS and dS will be given by 2dS = 2dS = R^dp., 

where R? = u\^U 2 ‘ Because of what was said above. 



r^dd — 



dz 

dw 



^ 2 ^ _ j^2 



a 



a 



(2.110) 



If now c = |c| = r‘^d and C = \C\ — R?p' are the angular momenta in the 
variables x\^X 2 ,t and ui,U 2 ^r respectively (recall that dt = Jdr), then 

cdt = r^dd = JR^^—dr = —Cdr, 
dr a a 



and thus 
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C 

c = — . (2.111) 

a 

Obviously, if at the origin dzjdw 0, then a = 1 and c = C. In the case of 
the Kepler problem, z = and then a = 2 , and consequently C = 2 c. 



2.7 Topology of the Two-Body Problem 

We have seen in Sect. 2.5 that the Delaunay elements for Keplerian motion 
represent a set of action-angle variables. This entails, as we have seen in 
Sect. 1.17, that the phase curves wind round a two-dimensional torus; more- 
over, as the motion is a periodic motion, these curves are closed: projection 
on the plane of motion yields for each of them an ellipse (we are dealing only 
with the case h < 0). It is therefore evident that the topological structure of 
the manifold round which the phase curves of the Kepler problem are wound 
is that of a two-dimensional torus. In fact, the topological properties of a 
manifold are those properties which are invariant under homeomorphisms^ 
that is, under transformations which are one to one and bicontinuous: this 
is the case here. However, we want to reobtain this result in a more direct 
way, without passing through the action-angle variables, but looking, from 
the beginning, at the geometric aspect of the problem. 

Let the XiX 2 plane be the plane of motion and consider a unit mass; 
moreover, for the sake of simplicity, we choose a system of units in which 
/i = 1. If we put Xi — xs, X 2 — X 4 , the system of equations will be: 

Xi=Xs, ±2=X4, ^3 X4 = -^, r = ^Jx\ (2.112) 

The phase space consists of the four-dimensional space X2 X3 X4 ; in this 
space, deprived of the plane = 0, X2 = 0 , (2.112) define a nonsingular 
family of phase curves. For system (2.112), the first integrals of energy and 
areas are 

2 

+ = - + 2h, (2.113a) 

X 1 X 4 - X 2 X 3 = c. (2.114a) 

Equations (2.113a) and (2.114a) each define a three-dimensional manifold: 
the integral manifold of the energy (which will be called E) and the integral 
manifold of the areas (which will be called A). Since (2.113a) and (2.114a) 
are independent first integrals of system (2.112), if we simultaneously fix the 
constants h and c, we come to define a new manifold (intersection of E and 
A), which we shall call EA, having dimension two, on which the phase curves 
of the system will develop in full. 

Equations (2.112), as we know, in addition have another first integral, 
corresponding to the existence of the Laplace-Runge-Lenz vector, which we 
shall write in the most general form: 
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arccos arccos 

r 



-r + (XiX4 - X2 XsY 



-J (X1X4 - X2Xs)^ {xl + - ^) + 1 



= u. 



(2.115) 



By introducing the polar coordinates r and 'd in the X\X 2 plane and fixing 
the constants h and c, (2.115) becomes 



'd — uo — arccos 



— r + 

r\/2hc^ T 1. ’ 



from which 



1 + e cos (i? — cj) ’ 



(2.116a) 



where = 1 + 2/ic^. For < 0 , to have real solutions, it must be (? < 
— l/2h. In the case c = 0 , from (2.115), we have = cj + tt , that is, 
rectilinear motion. For c 0, (2.116a) guarantees that the projections of 
the phase curves in the X\X 2 plane are conic sections and, in particular, for 
/i < 0, ellipses. 

Let us now see what is the topological structure of the manifold E. We 
shall prove that E is homeomorphic with the (Euclidean) three-dimensional 
space minus a straight line. If we consider the phase space of our system, 
which is the four-dimensional space ^1X2X3 X 4 minus the points of the X3X4 
plane, and define in it the transformation 



r' = -, 'd' — 'd, Xq = X 3 , X 4 = X 4 , (2.117) 

r 

this transformation is a homeomorphism of the four-space (minus the X3 X4 
plane) on itself. In fact the first two of equations (2.117) define an inversion 
with respect to the unit circle in the X1X2 plane, while the remaining two 
leave the coordinates X3X4 unchanged. By substituting (2.117) in (2.113a), 
we get 

x'^ +xf =2r'-h2/i. (2.113b) 

Since 1?' does not appear in (2.113b), the hypersurface having (2.113b) as its 
equation is a hypersurface of revolution. It is obtained by rotating, around 
the X3 X4 plane, the paraboloid whose equation in the three-space x'3 X4 r' is 
given by (2.113b) itself. It must be stressed that now we are considering r' 
as an orthogonal Cartesian coordinate, with the condition r' > 0, and the 
paraboloid does not intersect the line r' = 0, since < 0. As it is known that 
a paraboloid is homeomorphic with the interior of a half-plane, it follows 
that the hypersurface of revolution is homeomorphic with that obtained by 
rotating a half-plane around its boundary line. Therefore one obtains a three- 
dimensional space minus a straight line. 

Now we shall deduce, as a corollary, that A, in the case c 0, is also a 
manifold which is homeomorphic with the three-space minus a straight line. 
First of all, note that, for c y^ 0, according to (2.114a) A does not contain 




2.7 Topology of the Two-Body Problem 173 

any one of the points of xi =0, X 2 = 0 we have deleted. Furthermore, let us 
suppose c < 0; the same method can be applied for c > 0. If we now perform 
the rotations, in the planes ^1X4 and X2X3, defined by 

5x\ = x'l — x\ — X4 , 8 x 4 ^ = x'l — X4 = —x'( 

(which imply that X 4 = x'l + x'{ and xi = x'{ — x'l , from which ^1X4 = 
x'f — x'^ ) and by 

5 x 2 = x'l — X 2 = x'l, 8 xs = x'l — X3 = —x'l 

(which imply that X2 = x'l — x'l and X3 = x'l + xl , from which X 2 Xs = 
xl -x'l ),weget 

c = Xl X4 — X2 X3 = x'/ - xl — xl x'f . (2.118) 

By writing g and ^ for the polar coordinates in the xl xl plane, we can write 
(2.118) as 

x'l + X3 = -\- c. (2.119a) 

By means of the transformation x'(' = x'/, x'l' = x'l, g'" — g^, 
which is a homeomorphism of the four-space, (2.119a) becomes 

x'f ^ x'l'" = g'" + c. (2.119b) 

As c < 0, (2.119b) is analogous to (2.113b), and then one can draw 
the same conclusions: A is also homeomorphic with the three-space minus 
a straight line. For c > 0, the same method applies; moreover one can show 
that for c = 0 the manifold A (minus the points Xi = 0, X2 = 0) continues 
to have the same topological structure. 

Finally, let us move on to study the manifold EA. Suppose we have fixed 
the two constants c and h, with h < 0 , and consider the intersection EA of 
the two manifolds E and A. We shall prove that 

1) If 0 < < — ^ , the manifold EA is homeomorphic with the surface of 

a torus. 

2) If c = 0 , EA is homeomorphic with the interior of an annulus. 

3) If 7 ^A is homeomorphic with a circle. 

The equations for EA are given by system (2.113a), (2.114a); in the coordi- 
nates r', t?', X3, X4, they become the already seen (2.113b) and 

X4 cos i9' — X3 sin i9' = cr'. (2.114b) 

The latter can be written 




(2.114c) 
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which, for any fixed represents in normal form the equation of a plane 
passing through the origin of the system of coordinates X3 x'^ r' and forming 
an angle given by 

arcsin 




with the r' axis. This plane intersects the x'^x'^ plane along a line which 
makes an angle 1?' with the X3 axis. If we now make the angle vary from 
0 to 27 t , the plane starts from the position given by x'^ = c r' in the sense 
from the positive Xg axis to the positive X4 axis. Since the paraboloid given 
by (2.113b) is, in the space x'^x'^r ' , a paraboloid of revolution, the type of 
intersection with the plane will not depend on 'd' . For 'd' = 0, the intersection 
curve reduces to equation (2.113b) and to X4 = cr', whose projection on the 
plane Xg X4 is given by 




For 1 2 h > 0, the intersection is represented by an ellipse. As varies 

from 0 to 27 t, the ellipse rotates around the r' axis and at last again reaches the 
starting position. The surface generated by the rotation of the ellipse is that 
of a torus. On this torus we can make use, as coordinates, of -i?' (longitude) 
and of the angle ^ (latitude) that the vector which specifies a point on the 
ellipse makes with the X4 axis (which encloses the centre of the ellipse). 

In the case c = 0, (2.113b) and (2.114b) become (for i9' = 0) 

x'^=2r' + 2h, X4=0. (2.121) 



Instead of an ellipse, we have a parabola. As 'd' varies, the generated sur- 
face will be the topological product of a parabola and a circle. Since the 
parabola is homeomorphic with an open interval, the generated surface will 
be homeomorphic with the interior of an annulus. 

Finally, if c = ±^—l/2h , the ellipse degenerates in the point with coor- 



dinates 




As 'd' varies from 0 to 27 t, this point will describe a circle. Therefore, points 
(1), (2) and (3) are demonstrated. 

Once the nature of the manifold EA corresponding to a given value of c 
has been clarified, it remains to see how, as c varies, the manifolds EA decom- 
pose the manifold E corresponding to the value of h fixed at the beginning. 
To do this, refer again to the coordinates defined in (2.117). By keeping & 
fixed, consider the three-dimensional space r ' Xg X 4 and t he angular momen- 
tum c that varies in the interval [—^J—l/2h, +\/— l/2/i]. There will be two 
families of paraboloids corresponding, in the two half-spaces of r' Xg X4 , to 
the negative and positive values of c respectively. In Fig. 2.11 we consider 
= 0 and the paraboloids viewed from along the r' axis. If we make i?' vary. 
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the whole figure rotates around the origin, and then, as varies from 0 to 
27 t, the figure is topologically multiplied by a circle. The situation may be 
more clearly understood if first the plane of Fig. 2.11 is mapped into 

the half-plane ^77, ^ > 0 of Fig. 2.12. By rotating the families of curves in 
Fig. 2.12, the degenerate cases included, around the 77 axis, one obtains the 
families of tori, and relevant degenerate cases, which fill the three-dimensional 
space minus a straight line. The phase curves are such as to give, for any fixed 
triple /i, c, u) (with < 0, c 7^ 0), a projection on the x\ X 2 plane consisting 
of the ellipse given by (2.116a). If we perform the transformation (2.117), 
(2.116a) becomes 



1 + vTT~ 27 T? cos (7?' — Lo ) 



(2.116b) 



which is the equation of a limagon. From (2.116b), we see that r' is a one- 
valued function of 7?', and then, as the latter varies, it varies between two 
extreme values (a minimum and a maximum). Equation (2.116b) represents 
the projection of a curve which is on the torus EA given by (2.113b) and 
(2.114b). The intersection of EA with the plane d' = const, is an ellipse 
(see Fig. 2.11); to every value of 7?' , (2.116b) makes a value of r' correspond: 
then there is a point of the trajectory on every ellipse 7?' == const. As 
7?' varies, the point varies on the torus; therefore we shall have a situation of 
the type ^ = f (7?'), with / (7?' + 27t) = / (7?'), mod 27 t. The same will be 
true for all the trajectories, except for a translation of 7? (see (2.116b)). From 
(2.116b) we also see that / is a monotonic function, either strictly increasing 
or strictly decreasing, and therefore one can invert it to obtain i9' = g {^). In 
the cases c — l/2/i, EA reduces to a closed curve whose projection is a 

circle. 




Fig. 2.11 



Fig. 2.12 




Chapter 3 

The A/^-Body Problem 



The order of the chapters, and thus the framework of our treatment, reflects 
the fact that we have followed the traditional route; that is, to examine first 
the two-body problem and then the A^-body problem. In abstract terms, 
it might appear more sensible to proceed backwards, from arbitrary N to 
the particular case N — 2. The fact is that the two problems, N = 2 and 
N > 2 are qualitatively and not just quantitatively different, and therefore 
the traditional route, beside not running against the ideal logical course, 
also appears more promising from the didactical point of view. The subject 
of the present chapter is thus the problem of the motion of N pointlike 
masses, interacting among themselves through no other forces than their 
mutual gravitational attraction according to Newton’s law. 
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3.1 Equations of Motion and the Existence Theorem 

When we refer to a system of N bodies interacting through their mutual 
gravitational attraction and confined to a delimited region of space, we are 
actually speaking of a mathematical model we think appropriate to represent 
some aspects of real systems like the solar system and stellar systems (open 
and globular clusters, galaxies, clusters of galaxies, etc.). In the first instance, 
with N quite small {N ~ 10), we are in the framework of the discipline 
traditionally called celestial mechanics; in the second instance we have N > 
10^ and we can refer to the approach proper of stellar dynamics. Obviously, 
in both cases the problem is to understand if and how the mathematical 
model of N material points (that is, of bodies endowed with mass, but with 
no extension) is suitable for describing the motion of N bodies with finite 
and, in general, different sizes. 

In his mighty treatise on celestial mechanics, F. Tisserand, dividing the 
problem into two distinct sub-problems, set the question in the following way: 

. . . Considerons Tun des corps de notre systeme; nous pouvons decomposer son 
mouvement en deux autres: le mouvement de son centre de gravite et le mouvement 
du corps autour de son centre de gravite. De la les deux principaux problemes de 
la Mecanique Celeste: 

1) Determiner les mouvement s des centres de gravite des corps celestes; 

2) Determiner les mouvements des corps celestes autour de leurs centres de 
gravite.^ 

It remains now to see if the decomposition proposed by Tisserand is justified, 
that is, if the “external” and the “internal problem”^ are really separable. 
Analysis of the infiuence of the internal structure on the external problem^ 
shows that it is of the order of s o? with respect to the Newtonian attraction 
of two pointlike masses, Gw? jF?, where ol = LjR, with L the characteristic 
linear extension of the bodies, R the scale of their separation and e a number 
between 0 and 1. Since in planetary and stellar systems it is almost always the 
case that > L and, in general, e < 1 as well, we get ea‘^ <^1 and we say 
that there is an effacement of the internal structure. On these foundations 

^ Let us consider one of the bodies of our system; we can decompose its motion 
in two components: the motion of its centre of mass and the motion of the body 
about its centre of mass. From here, the two main problems of Celestial Mechanics 
follow: 1) To find the motion of the centres of mass of celestial bodies; 2) To find 
the motion of celestial bodies about their centres of mass. (F. Tisserand: Traite 
de Mecanique Celeste, Vol. I (Paris, Gauthier-Villars, 1889), pp. 51-52, English 
translation is ours). 

^ This is how Fock described Tisserand’s points (1) and (2). See, V. Fock: The 
Theory of Space-time and Gravitation (Pergamon Press, 1959). 

^ See the clear and exhaustive treatment given by T. Damour: The problem of mo- 
tion in Newtonian and Einsteinian gravity, in Three Hundred Years of Gravitation 
S. Hawking, W. Israel eds. (Cambridge University Press, 1987). 

^ This term is due to Brillouin and Levi-Civita (see the paper by T. Damour cited 
above) . 
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we will apply our mathematical model with the well-grounded feeling that it 
is suitable for the real cases mentioned above. 

We are thus given a system of N{> 2) material points Pi, P 2 , , Pn in 
motion under the action of the Newtonian gravitational force. This implies 
that we specify: 

a) the positive constants G; mi, m 2 , . . . , rnjsf] 

b) a Cartesian coordinate system, suitably chosen in three-dimensional Eu- 
clidean space; 

c) an independent variable t. 

The equations of the motion are then 

mkVk = y] [Vj - rk ) , (j + k), rjk = \rj - Vk\, (3.1a) 

j=i 

where the vector rk gives the position of the point Pk = Pk Vk^ ^k) and 
the right-hand side of the k-th equation of (3.1a) represents the total force 
acting on the A:-th point by the other — 1 material points. The coordinate 
system Oxy z is an inertial system, which we will call, for brevity, the fixed 
system, the variable t is the absolute time^ of Newtonian mechanics, and the 
positive constants mi, m 2 , . . . , misf are the masses of the material points. 
All these notions are inseparable ingredients of the Newtonian mechanics. 
Let us also assume that it is always possible to define the inertial reference 
frame uniquely. Defining the potential function 

U= Y. (3.2) 

l<j<k<N 

and the velocities Vk = Vk , we can rewrite (3.1a) in the form 

Vk=Vk, Vk = — ^ {k = l,2,...,N), (3.1b) 

rrik ovk 

Set (3.1b) is a conservative, reversible, dynamical system, with SA" degrees 
of freedom. From the point of view of the theory of systems of differential 
equations, it is a system of 6 A first-order equations. It is of the same type 
as (l.A.lb), with the vector X independent of time, and therefore it is an 
autonomous system: 

Xi = Ai {xi,X2, . . .,Xm) = = 6A). (3.3) 

If the m functions Xi are Lipschitzian in a real neighborhood of x = ^ , 
then, for given initial data x{to) = set (3.3) admits one and only one 

^ From the first Scholium of the Principia, op. cit. p. 6: “Absolute, true, and math- 
ematical time, of itself and from its own nature, flows equably without relation 
to anything external” . 
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solution. Moreover, if the functions X{ are regular analytic in the variables 
and are bounded by a positive constant M in the domain 
\^i ~ ^i\ < — 1,2, then the solution x (t) of set (3.3), which 

satisfies the initial condition a?(to) = is a regular analytic function of t in 
the complex neighbourhood 



\t - to\ < 



r 

(m + 1) M 



and satisfies 

\xi ~^i\ < r, i = 1,2, ...,m. 

This statement of the existence and uniqueness (Cauchy) theorem comes from 
Siegel and Moser, ^ to which we refer the reader for the proof. 

We will also follow Siegel and Moser for the application of the Cauchy 
theorem to set (3.1b) and will choose units so that G = 1 in (3.2). The initial 
time, rk{to) and Vk{to) are real but, to find the positive constants r and 
M, we will assume that the values of these variables are complex. Suppose 
that at the starting time to we have rjk{to) >0 {j ^ k) and let A be an 
upper bound of U at the time t = to- 



Uto < A. 



(3.4) 



Let us make the definitions 



g = mmj^krjk{to), /x = min/e (3.5) 

From (3.2), we have directly that 

— <Ut,< A, 

from which 

^ (3.6) 

We now want to find an upper bound for 



dU 




du 




dU 


dxk 


5 


dyk 


1 


dzk 



with k = 1,2, . . . , N and = (xk, Vk-, ^k)- To this end, we look for a lower 
bound of Tjk , for complex Vk near to rk{to)- Letting 



X =[xk- Xfc(io)] - [Xj - Xj{to )] , 
y = [yk- Vk (^o)] - [vj - Vj (to)] , 
Z =[zk- Zk (io)] - [Zj - Zj (io)] , 



^ C. L. Siegel, J. K. Moser: Lectures on Celestial Mechanics (Springer, 1971) Sects. 
4, 5. 
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we also have 

Xk - Xj = X + [ajfc(io) - Xj{to )] , 

Vk-Vj = Y + [yk {to) - Vj (io)] , 

Zk- Zj = Z + [Zk {to) - Zj (io)] • 

For rjf. we have 

r% = {xk - Xjf + {vk - + {zk - Zj)^ 

= r^k (^o) + +Y^ + Z“^) 

+ 2X [xk{to) - Xj{to)\ + 2Y [yk{to) ~ yj{to)] + 2Z [zk{to) - Zj{to)] . 

Using the Schwarz inequality to majorize the sum of the last three terms and 
substituting the second term with the sum of the squared moduli, we finally 
get 

r% > r%{to) - 2rjk{to)V\W+WW+W ~ + \Z\^) • (3.7) 

Then, if we impose the condition that 

\xi - Xi{to)\, \Vi - yi{td)\A^i - ^i{to)\ < Vz = l,2,...,V (3.8) 

SO that 

1^1, i>^i, m < I . 

|X|2 + |y|2 + |^|2 < ^^2 < 

inequality (3.7) becomes 

r% > r%{to) - 2rjk{to) | “ fg ^ ^Jfe(^o) > J r%{to), (3.9) 

as a result of the first of equations (3.5). Finally 

(3.10) 

To satisfy inequality (3.8), taking into account (3.6), we require that 

u? 

\xi-Xi{to)l \yi-yi{to)l \zi-Zi{to)\ < Vi = 1, 2, . . . , TV. (3.11) 

From (3.8) we also have 

\xj -Xk\ = \ [xj - Xj{to)] + Xj{to) - [Xk - Xk{to)] - Xk{to)\ 

< \xj - Xj{to)\ + \Xk - Xk{to)\ + \Xj{to) - Xk{to)\ 

1 / X 8 , , 

< -jQ-^rjk{to) < i^rjkito) 

and analogously 

g 

\yj-yk\, \zj-Zk\ <-rjk{to), Vj,k=l,2,...,N. 
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From these and (3.6) and (3.10), it follows that 

\xj-Xk\ ( 2 y 8 ^ 1 . 64 

~ \rjk{to)) ° 7 [rjfc(fo)]2 “ 7^^ 

and analogously 



-yfcl 



\zj - Zk\ ^ 64 
rjk^ ~ 7 ijl* ' 



As a consequence 



dU 




""" dxk 





E 

j#fc j'' 



< ciA^ 



and 



E (2/i - vk) 

j^k 



E 



m 



^jk 



^ - Zk) 



< ciA^ 



where the constant Ci depends only on the masses. Denoting by v one of the 
3N components of the velocity xi, . . . , xat, • • • , yi, . . • , iiv, we find that the 
energy integral (the system is conservative) gives 



1 (to) < Tto ^ Uto + h < A + h, 

where T is the total kinetic energy, h the total energy constant and m the 
mass of the body having one of the components of its velocity equal to v. 
Therefore 

1^ (^o)l < C 2 VA-\-h, 

with C 2 another constant depending only on the masses. Now writing 



14 A 



= r 



(3.12) 



and requiring, together with (3.11), that the inequality \v — v {to)\ < r is 
satisfied, we finally obtain 

I'^l < 1'^ - (^o)l + 1^ (^o)l < r + C 2 V A h. 



We can conclude, therefore, that, if the constants r and M in the statement 
of Cauchy’s theorem are given by (3.12) and by 



M — C\ + 14A \/A + /i, (3.13) 

in such a way that the functions on the right hand side of the set (3.1b) are 
regular and bounded in absolute value in the domains 

\xi - Xi{to)\, \Vi - yi{to)\, \zi - Zi{to)\ < r, 

\xi - Xi{to)\, \Vi - yi{to)\^ l^i - Zi{to)\ < r. 
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then the solutions {t) and Vk{t) , corresponding to the initial conditions 
rk{to) and Vk{to ) , are regular analytical functions of t in the complex neigh- 
bourhood: 

V 

^ ^ {6N + 1)M "" 



In particular this will be true in the interval 



^0 ^ ^ ^ ^0 H ” 



where S depends only on A, on h and on the masses. 

It is possible to prove^ that, in the case in which in place of system (3.1b) 
we have the Hamiltonian equations 

oqj f)qj 

P^ = -T- (^ = l,2,...,3iV), (3.14) 

dpi dqi 

with H = T — U ^ with the bounding conditions 

\n{p,q)\<M, 

h - qi(to)\ <2g, 

\Pi-Pi{to)\ <2g, 

we find that the solution is regular for 

I I 

^ {6N+1)M 

and that it satisfies 

hit) - qi{to)\ < g, 

\Pi{t) - pi{to)\ < g. 

Returning to the solution of set (3.1b), we see that, up to t — to < S, 
the solution Vk {t) remains regular as a function of t and, in particular, 
there can be no collisions. In fact it is obvious that, in the case of a collision 
between the A;-th and the i-th point, it should be the case that rki = 0 with a 
corresponding singularity in the potential and, with h remaining constant, at 
least one of the components of the velocities would become infinite and this 
clearly would be in contrast with the asserted regularity of the solution. If 
we analytically prolong the solution for t > to happen that all the N 

vectors Vk stay regular for every finite t > to or that a ti > is passed such 
that there at least one Vk {t) becomes singular: in such a case, for t > t\ 
the potential U tends to infinity. If this were not the case, in fact, we should 
have 

Ut, < A, 

and, again applying the existence theorem with ti in place of to^ we would 
find that the solution is regular, contrary to the hypothesis. This shows that 

C.L. Siegel, J.K. Moser: op. cit., Sect. 4. 



7 
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lim U = oo, 

t — yti 

that is to say, for the definition of [/, the least of the n (n — l)/2 distances 
Vjk {j ^ k) must tend to zero when t to. 



3.2 The Integrals of the Motion 

From what we have said in the preceding section, it appears that the solution 
of the A/'-body problem depends on: 

1) the condition of applicability of Cauchy’s theorem (only if these conditions 
are fulfilled does the theorem insure the existence of the solution); 

2) the actual construction of the solution, i.e., the integration of (3.1a) or 
(S.l.b). 

Historically, the second point, was the first faced by eighteenth-century and 
later mathematicians. 

The first milestone was established by Lagrange in a celebrated paper on 
the three-body problem.® We should first remark that the number of first 
integrals whose existence can be demonstrated does not depend on N but on 
the nature of forces; therefore the results obtained for N = 3 are valid for 
any N. That work concentrated on the three-body problem was due both to 
the fact that it was the easiest approach to the problem (for N > 2) and 
to the possibility of concrete application to the solar system. The system of 
differential equations obtained by Lagrange was of order seven and did not 
explicitly contain the time: it could already have been concluded that, by 
applying the procedure of eliminating time (see Sect. 1.11), the final system 
was of order six. In other words, the order of the initial system can be lowered 
by 12. This result, although it does not appear explicitly in Lagrange’s paper 
as far as the latter operation is concerned, was never improved upon in the 
next two centuries. By this we mean that the order of the system to be 
integrated has remained 6A^ — 12: over two hundred years later, the best 
efforts of the most eminent mathematicians have not modified this result. As 
we shall see, at the end of the last century, Bruns demonstrated, in fact, that 
this is the maximum number of integrals which can be obtained. 

In this section, we shall therefore try to demonstrate how the first integrals 
of the A^-body problem may be obtained and we shall also discuss their nature 
and meaning from different points of view, while in the next section we shall 
deal with the singularities, that is, when the Cauchy theorem cannot be 
applied and consequently the existence of the solution cannot be assured. We 

® J. L. Lagrange: Essai sur le probleme des trois corps [Recueil des pieces qui 
ont remporte le prix de I’Academie Royale des Sciences de Paris, tome IX, 
1772], reprinted in J. L. Lagrange: Oeuvres^ Vol. VI (Gauthier- Villars, Paris, 1873), 
pp. 229-324. 
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shall start by considering the problem in the Newtonian formalism; we will 
then look for the integrals in a direct way (see the remark at the beginning 
of Sect. 2.2). 

Let us begin with system (3.1a); summing the right-hand sides over /c , 
we obtain 












which turns out to be equal to zero, because in the sum to each term (r^ — r^) 
there will correspond a term (r^ — r^) that is equal and of opposite sign. That 
is, 

'^rukrk^O. (3.15) 

k 



This relation can be interpreted in the following way. Define the vector 






1 

M 



X] l^kVk, 

k 



(3.16) 



with M = '^k , which identifies the centre of mass of the system; then 

(3.15) is equivalent to 

rc.m. =0. (3.17) 

This can be immediately integrated giving 

r’c.m. = at -h 6, (3.18) 



where a and h are constant vectors that can be expressed as a function of the 
initial data: relation (3.18) then provides six first integrals. From a physical 
point of view, (3.18) expresses the conservation of the total momentum of 
the system. The centre of mass of the system moves with uniform rectilinear 
motion. The centre of mass can therefore be taken as the origin of a new 
inertial reference system with axes parallel with those of the previous fixed 
system, and thus the motion of the N bodies can be referred to their own 
centre of mass. 

The equations of motion do not change, since rc.m. = 0, and therefore we 
shall keep using the same notations, bearing in mind, however, that between 
Vk and Vk 

= 0 (3.19) 

k 

(the origin of the reference system is at the centre of mass) and 

mfe -Wfe = 0 (3.20) 

k 

(the total momentum referred to the centre of mass is equal to zero). These 
correspond to six scalar relations and as a consequence the order of the system 
is reduced to 6N — 6. 
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Let us now consider the system in the form (3.1b) and multiply scalarly 
for ffc; by summing over k one gets 

^ dU drk 
^ dvk dt 

k 



^ ruk rk'T 



from which 



A 

dt 



1 

2 



^ rrik Vk^ 



k 



~di' 



and then, since the quantity in square brackets is the total kinetic energy of 
the system T, 

T=U + h, (3.21) 



where h is constant and equal to the total energy. By applying the energy 
integral, the order is reduced to 6A^ — 7. 

Starting from (3.21), we can obtain an important relation known as the 
Lagrange- Jacoby identity.^ Defining the moment of inertia of the system as 



I = '^mkrk‘^ rk-Vk, 

k k 



(3.22) 



and differentiating twice, we have 



i = 2'Y^mk {Vk-Vk) + 2 



E 



Vk-ruk Vk 



4T + 2^ 

k 



dU 

OVk 



As is a homogeneous function of degree —1 in the variables {xk^yk, Zk) = 
rk , Euler’s theorem enables us to write 



TT 



Therefore I = AT — 2U = 2U + 4 A, so that 

l/ = f7 + 2/i 



(3.23) 



(Lagrange- Jacobi identity). 

We will now obtain a further reduction of the order of the system by 
3, taking into account angular momentum conservation. Let us vectorially 
multiply system (3.1a) by and sum over k: 



LI (rkXTk) = L 

k k 






G ruj ruk 



{vj X r*) . 



® This equation was obtained by Lagrange for the three-body problem in the paper 
mentioned in Footnote 8 and generalized by Jacobi (see the fourth lecture of 
Jacobi: op. cit. p. 22) 
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The sum over k to the right gives a null result, because of the antisymmetry 
of the vector product and, for the same reason, we have for the left-hand side 



That is. 



dt 



y] ruk {rkX-Vk) 

• k 



= 0 . 



^ rrik (rkXVk) = c, (3.24a) 

k 



where c is a constant vector that represents the total angular momentum of 
the system. Equation (3.24a) corresponds to the three scalar relations 



E (yk Zk - zkVk) = Cl, 
k 

TUk {Zk ik - Xk Zk) = C 2 , 
k 

E Vk “ Vk Xk) = C3, 

k 



(3.24b) 



and c = (ci , C 2 , C 3 ) . Hence the order of the system is reduced to 6 10. The 

ten constants of the motion ai,a 2 ,a 3 , 61 , 62,^35 ci,C 2 ,C 3 , h are algebraic 
functions of the coordinates and velocities of the N mass points and of the 
time and one can demonstrate that they are independent of each other. 

As we have already said, in addition Bruns^^ demonstrated that further 
independent algebraic integrals do not exist. Later on, Painleve^^ was able 
to extend the demonstration to the case where the algebraic dependence is 
required only for the dependence on the velocities, while the dependence 
on the coordinates can be anything. Furthermore, it has been demonstrated 
that this result also applies when integrals are substituted for the algebraic 
functions, that is Abelian functions can also be included. Bruns’s theorem, 
although it ended the search for integrals of the type mentioned above, did not 
stop the question being asked. As well as the tendency to interpret Bruns’s 
result as a proof of the nonexistence, tout court, of integrals in addition to 
the known ones, we also mention the position of those who considered it 
irrelevant from the point of view of dynamical systems. As an example, here 
is Wintner’s conclusions^ 

Bruns: Uber die Integrale des Vielkorper-Problems, Acta Math.^ 11, 25-96 
(1887). The demonstration of Bruns’s theorem can also be found, for instance, in 
Whittaker: op. cit. pp. 358-379. 

^^P. Painleve: Memoire sur les integrales premieres du probleme des N corps. Bull 
Astr., 15, 81-113 (1898). 

^^A. Wintner: The Analytical Foundations of Celestial Mechanics (Princeton Uni- 
versity Press, 1947), p. 97 (the notations in the text have been adapted according 
to our Sect. 3.1). 
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. . it must, however, be said that the elegant negative results of this arithmetical 
type do not have any dynamical significance. For all that is of dynamical interest 
is an enumeration of all those independent integrals F (x) which are isolating. 
Now, even if X{x) is algebraic, the algebraic character of an integral F (x) of 
(3.3), though sufficient, is by no means necessary for an F {x) which is an isolating 
integral.” 

The integrals of dynamical interest, Wintner maintains, are those which have 
the property of being isolating^ i.e., those which enable one to make predic- 
tions concerning the possible future (or past) positions of the solution path 
which goes, for instance, at t = to ^ through x = Xq. 

A more careful characterization of the nature of isolating integrals will 
be given in Sect. 5.1, to which we refer the reader. Taking up the thread of 
our argument again, we remark that it is possible to obtain a further low- 
ering of two in the order 6N and to reach the order 6 A' — 12, by applying 
two procedures, the first being the already mentioned (Sect. 1.11) elimina- 
tion of time, and the second, due to Jacobi, the elimination of nodes. The 
procedure of eliminating nodes was introduced by Jacobi for the three-body 
problem and later on extended by other people to the case of any N; it has 
been reformulated in various ways, and a satisfactory explanation would take 
up too much space. We limit ourselves to taking it up as follows, using the 
words out of Whittaker’s book (p.341): when one of the coordinates which 
define the position of the system is taken to be the azimuth 0 of one of 
the bodies with respect to some fixed axis (say the axis of z), and the other 
coordinates define the position of the system relative to the plane having this 
azimuth, the coordinate (j) is an ignorable coordinate, and consequently the 
corresponding integral (which is one of the integrals of angular momentum) 
can be used to depress the order of the system by two units (with a net profit 
of one). We shall again find this result later on when determining what are 
the integrals in involution admitted by the A-body problem. 

Let us now tackle the problem from a Lagrangian point of view, by apply- 
ing Noether’s theorem, as we have already done in the case of the two-body 
problem in Sect. 2.2. In that case, having simply considered the system in the 
plane of motion, the transformations involved were dependent on only one 
parameter. Now, on the other hand, since we are obliged to consider the A- 
body system in three-dimensional space, this will no longer be possible; it is 
enough, in fact, to think about the example of rotations: in three-dimensional 
space, a rotation will, in general, depend on three parameters. So it will be 
better to write (1.B.35) and (1.B.37) in a more general form, making use of a 
parameter e = (ei, £ 2 , • • • , ^r)i with r < n. Thus, we will replace (l.B.37a) 

G. J. Jacobi: Sur relimination des noeuds dans le probleme des trois corps, 
Crelle’s Journal f.r. und ang. Mathematik Bd., XXVI, 115-131 (1843). 

^^The first application of Noether’s theorem to the A-body problem goes back 
to Bessel-Hagen. See: E. Bessel-Hagen: Uber die Erhaltungssatze der Electrody- 
namik. Math. Ann. ^ 84, 258-276 (1921). 
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with 

t = t + £sTs{t,q,q)->ro{e), (ik = <lk + [t,q,q) + o{e), (3.25) 

where 5 = jej, 5 = 1, 2, . . . , r < n, /c = 1, 2, . . . , n. It is easy to check 

that the Killing equations will now govern the generators and and 
that (1.B.48), which defines the first integrals, will become 

d ^ 

Is = Cts + — (Cs - QiTs) - fs, s=l, 2, ...,r (r < n). (3.26) 

As the Lagrangian of the system, we define 

1 ^ 

C = T U = - rrik + t/, (3.27) 

^ k=i 

where U is given by (3.2) and the Cartesian coordinates will be taken as 
Lagrangian coordinates according to the rule 



xi = qi, yi = Q2, zi = qs] X 2 = ^4, • • • , = Qn-i, zn = qn, n = 3N. 

In the same way, the conjugate momenta pi = dCjdqi are 



=mixi, p2 = mi^i, ... , pn-i =rriNyN, Pn = mNZN^ 

Let us now proceed to verify with what types of symmetry and quasi- 
symmetry our system is endowed. We begin with time translations. If t\ = 
— 1, /i = 0, (^1=0, Vi , then the integral 

o n 

h = ^q^-C = r-U = h (3.28) 

dqi 

exists end represents the total energy. Then define the space translations: 



i = t, Xk-^Xk+S2, Vk^yk+^3, Zk=Zk^S4, 

with Tg = 0, = 1, 5 = 2, 3, 4, Vi = 1, 2, . . . , n. The corresponding Killing 

equations are satisfied by / 2 , /s, / 4 , = 0 . For the sake of brevity, we omit, 
here and in what follows, the details of the calculations. Equations (3.26) give 

I 2 = ruk Xk = const., 
k 

Is = TUk ijk = const., 29 j 

k 

I 4 = ^ ruk Zk = const., 
k 

that is, the conservation of the total momentum. Integrals of the type (3.29) 
could not be present in the case of the two-body problem, since we studied 
it in the field of a fixed centre and so precluded a priori invariance under 
translations. 
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Let us now study the behaviour of the system under spatial rotations. By 
considering a spatial rotation as the result of three rotations in the planes 
xy, xz, yz, and denoting by £5, Sq, 67 respectively the corresponding pa- 
rameters, we have 

Xk=Xk- e^yk 

y*. = j/fc + £5 Xk -erZk, (3.30) 

Zk = Zk- eeXk + e7Vk- 

The generators will then be given by Ts = 0, for s = 5, 6, 7, and 

( -Vk, (Xk, rO, s = 5, 

= \ Zk, ef = \ 0, c" = -Xk, S - 6, (3.31) 

lo, [-Zk [yk, S = 7, 



where we have labelled with suffixes x, y, z the three groups of generators 
corresponding to the transformations of the coordinates Zk (with 

/c = 1, 2, . . . , A^) respectively. Also in this case, it is possible to check that 
the Killing equations are satisfied by /s = 0 (5 = 5,6,7). Equations (3.26) 
give 



^fxk.^ fryk , ^ czk 

dxk^^ dvk^^ dzk^^ 



= Is 



= const. 



(s = 5,6, 7). 



By inserting (3.31), one obtains 



h='Y^fnk {xk Vk - VkXk) = C3, 
k 

h = '^rnk {zk Xk - Xkh) = C2, (3 32) 

k 

I7 = '^mk ivk Zk - ZkVk) = Cl, 
k 

that is, the components of the angular momentum. 

Lastly, let us consider a final type of transformation. Since the motion 
of our system has been referred to an inertial reference system, one expects 
invariant behaviour under a transformation that is equivalent to passing to 
another inertial system. The transformation will be of the type 

t = t, Xk = Xk + es, t, Vk=yk + £9 1, Zk = Zk + £10 1, (3.33) 



which represents a translatory motion at constant velocity of components 
£s, £9, siQ. The corresponding generators will be Ts = 0, for 5 = 8,9, 10 and 






xk 



S ^yk 

0, s = 9,10, 






9, 



0, s = 8,10, 






zk 



do', 



s = 10 
s = 8,9. 



(3.34) 



Unlike the transformations considered up to now, in (3.34) we have generators 
depending on time, and so the Killing equations will also contain partial time 
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derivatives, both of generators and of the functions fg. They turn out to be 
satisfied by (3.34) if 

fg = '^mk Xk, f9 = '^mk Vk, fio = Y^mk Zk; (3.35) 

k k k 

hence the case in hand is a case of quasi-symmetry. The corresponding first 
integrals are 

^8 = i ^ m/c ifc - ^ mfc Xk, 
k k 

I 9 = t rriki/k - ruk Vk, (3.36) 

k k 

ho^t'Y^rrikZk-'^ rrik Zk- 

k k 

As for their physical meaning, if we compare (3.36) with (3.16), (3.18), we 
immediately get 

M = mkXk, Ma.y^'^mkVk, M&^ = y^ mkZk 

k k k 



and 



M Sixt - rukXk = -Mhx = /s, 

k 

M Siyt - ^ rrikXk = -Mhy = /g, 
k 

Mdi^t-Y^mkXk = -Mh^ = /lo; 

k 



(3.37) 



therefore Is, /g, iio are the first integrals connected with the uniformity of 
the centre of mass motion. Furthermore, it is possible to demonstrate that the 
set of transformations we have considered constitutes a group, a 10-parameter 
group, usually named the Galilei group. 

If we now consider the Hamiltonian of the system, T-L — T — U = h, and 
evaluate the Poisson bracket. 



(I dh dis dh 

^Ydxkd{mkXk) d{rrikXk)dxk 



k 


\dh 


dh 


dh 


dh] 


dpk dimkVk) 


dimkVk) dyk_ 


k 


'dh 


dh 


dh 


dh] 


dzk d{mkZk) 


d{rrikh) dzk\ 



and analogously ( 7 g, 7 i) and (7io, 7 i), it is easy to verify that they are differ- 
ent from zero. Therefore the integrals 7 g, 7 g, 7io are not in involution with 
the Hamiltonian. In the same way, one can easily verify that 7 g, 7 g, 7io are 
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in involution with each other. Passing to the components of the angular mo- 
mentum, we find that each of them is in involution with = cf+cl+cl; if we 
pick out one of them, we get, together with c^, a pair of independent integrals 
in involution. If now, at the end, we evaluate (/g, c^), (/g, c^), (/lo, c^), we 
get a result different from zero. Nevertheless the three brackets will vanish if 
we fix the origin of the coordinate system at the centre of mass (in this case 
Ig = Iq — /^Q = 0). Therefore, by putting the origin at the centre of mass, 
our system gets five integrals in involution. This is equivalent, as we know 
from Lie’s theorem (see Sect. 1.18), to ten first integrals. If we add to these 
ten integrals the energy integral and eliminate the time, again we reduce the 
order of the system to 6N — 12. To sum up: the possibility, implicitly con- 
tained in Lagrange’s paper, of reducing the order of the system to 6N — 12 
has been checked by different methods by a number of mathematicians over 
more than two centuries. In particular, the most recent reductions (Poincare, 
Whittaker, Levi-Civita) leave the canonical form of the equations of motion 
invariant. 



3.3 The Singularities 

We saw in Sect. 3.1 that, if the initial conditions are such that as defined 
in (3.5), is positive, then, for Cauchy’s existence theorem, there exists a real 
analytical solution for system (3.1b) in an open time interval \t — to\ < S. 
This local result can be extended, analytically prolonging the solution for 
t > to + S. li this analytical continuation of the solution is no longer possible 
at a certain time t = ti, we say that the problem has a singularity at t = t\. 

At this point we need to clarify the nature of this singularity, since one 
might wonder if it could be eliminated in some way (changing variables or 
by some other trick). We ask ourselves if the singularity at time ti is due to 
a physical collision between two or more material points or to an unlimited 
motion in a finite time or, finally, to uncontrolled unlimited oscillations of 
some points. Here and in the following, by a “collision” at the time t = t\ 
we mean that, for t ^ ti, the position vectors of the N material points tend 
to well-defined limits of which at least two are coincident. 

The systematic analysis of the singularities for the A^-body problem was 
begun by P. Painleve, who in 1895 gave at the University of Stockholm a series 
of lectures on differential equations that at the time were exceptionally well 
received.^® Painleve was the first to raise the question if all singularities are 
due to collisions. Before illustrating his result, let us introduce some useful 
relations. First we calculate the quantities 

^^See, for instance, E. T. Whittaker: op. cit.. Chap. XIII. 

^®The lectures were published in 1897 - P. Painleve: Legons sur la Theorie Anal- 
itique des Equations Differentielles Professees a Stockholm (A. Hermann, Paris, 
1897). The iV-body problem constitutes the final part of the treatise (pp. 582- 
589). 
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Y^rrij {rj -Tkf = '^mjr^+Y^mjrl-2rk-J2mjrj, j,k = 1,2, N. 

3 3 3 3 

If the coordinates are referred to the centre of mass, the last term on the 
right is zero, so that 

Y^ruj {rj-Tkf = I + Mrl. 

3 



Multiplying by ruk and summing over k we find that rrik rrij = 

M I M I = 2/ M, which can be written as 

rrij rrik = I M. (3.38) 

^<j<k<N 

Let us define the following 



r (t) = miuj^k rjk{t), R (t) = max^y/, rjk{t). (3.39) 



Remembering definition (3.2) of the potential U, we soon obtain (in units 
such that G = 1) 



and since T,i<j<k<N 



E 

l<j<k<N 

rrij rrik < M^/2, we 



rrij rrik 
r 

also have U < M‘^ I2r. Moreover, 



^ ^ rrijmk ^ 

'f'jk '^jk 



V 1 < j. A: < A^, 



where ji was defined in (3.5), and, since at every instant at least one Vjk 
is equal to r, it will be also U > jr. There therefore exist two positive 
constants A and B {A = B ^ M^/2) such that Ar~^^ <U < Br~^, or 



AU~^ < r < BU~^, (3.40) 

so that U~^ represents an estimate of the minimum distance among the 
points in the system. 

Let us see now the role of the moment of inertia I. From (3.38) and (3.39), 
we have the following chain of inequalities: 



- M 






j<k 



j<k 



ruj rrik r% ^ \mB? 



3<k 



SO that 



< I < -MRr 
M - - 2 



Finally, 



<R< with A=J—, B = 



Vm 



(3.41) 



and thus ^/I is an estimate of the greatest separation between the points. 
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We now arrive at the results obtained by Painleve, quoting his statement 
of the theorem on the existence of the singularities in the A/'-body problem: 

Soit r{t) la plus petite des distances rij a Tinstant t. Si le mouvement est regulier 
pour t < ti mais point au dela, r{t) tend vers zero quand t tend vers t\\ c’est 
a dire que pour t suffisamment voisin de ti, r{t) reste inferieur a toute quantite 
donnee a I’avance 

That is, a necessary and sufficient condition for there to be a singularity at 
the instant t = t\ is that r ^ 0 for t -4 ti; therefore, from (3.40), we 
also have U oo for t ti. That the condition is sufficient follows from 
the definition of singularity; that it is necessary follows instead from the 
considerations at the end of Sect. 3.1. Incidentally, Painleve noted that the 
condition r — )► 0 for t — > does not necessarily imply that the singularity 
was a collision. In fact, it may happen that some of the distances Vij oscillate 
in such a way that 



lim inf Vij — 0, lim supr^^ > 0, 

whereas r 0; that is, r may tend to zero, and so there can be a singularity 
without a collision. Following a suggestion by Poincare, Painleve spoke in this 
case of pseudocollisions}^ In any case, he succeeded in proving a theorem 
relative to the three-body case: 

For N = 3^ all the singularities are collisions. 

The solution to the question posed by Painleve, for > 4 , was provided 
about a century later, as we will see at the end of this section. But, to follow 
strict chronological order, note that an important step forward was taken in 
1908 by Von ZeipeF^ with the following theorem: 

If some of the material points do not tend to a finite limiting position, 
for t ti, then necessarily R — cx), in the limit t t\, being R the 
greatest distance among the points. 

This means that, if the singularity is due to a collision, the system stays 
bounded and the only way to have a pseudocollision is for the system to “blow 
up” to infinity in a finite time. The conclusions by Painleve and Von Zeipel 
were taken up again lately and, after more than half a century, new results 
were obtained that further clarified the conditions in which a singularity 
occurs. The subject is further considered in a series of theorems, some of 
which will be stated here without proof, which would in general be quite 

^^Painleve: op. cit., p. 583. 

^^Painleve: op. cit., p. 588. 

Von Zeipel: Sur les singularites du probleme des n corps, Arkiv for Matem- 
atik, Astronomi och Fysik, 4, 32, 1-4 (1908). The theorem was redemonstrated 
in “modern” language by McGehee - R. McGehee: Von Zeipel’s theorem on sin- 
gularities in celestial mechanics, Expo. Math., 4, 335-345, (1986). 
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involved and “technical” . Since we are treating an autonomous system, and 
therefore can “translate” the solutions by changing the origin of the time 
axis, it is convenient to take ti = 0 and so always consider the limit for 
t 0. Let us moreover assume the following definitions: 

a) A function f(t) is said to slowly vary for t 0 if / f{t) 1 for 

t ^ 0 , with j3 an arbitrary positive constant. 

b) A function g{t) is said to regularly vary for t > 0, if g{t) = f{t)^ 

where / is a slowly varying function for t 0 and a is bounded from 
below by a suitable constant A. 

Let us now move on to the statement of the first theorem: 

If there is a singularity at the time t = 0 and if I is a slowly varying 
function for t 0, then the singularity is due to a collision. 

As a corollary of this theorem we have Von Zeipel’s theorem mentioned above: 

If / = O (1) for t ^ 0, then the singularity at the time t = 0 is a 
collision. 

The proof of the corollary is simple: if there is a singularity at t = 0, then, in 
agreement with Painleve’s result, U ^ oo for t -> 0. From the Lagrange- 
Jacobi identity (3.23), it follows that I oo for t ^ 0; therefore the 
function I (t) has an upward concavity and may tend either to a positive 
limit (/ > 0) or to infinity, for t 0. The infinite limit contradicts the 
hypothesis / = O (1); only / > 0 remains finite. If / 0 for t — )> 0, then 

-> 0, V /c as well, and therefore there is a collision. If I stays greater than 
zero for t ^ 0, then it is a slowly varying function and the theorem by Saari 
can be applied, so that again there is a collision. The corollary is then proved. 
Let us state the second theorem: 

If the N material points are placed on a straight line, then all the singu- 
larities are due to collisions. 

We may assume all the points on the x axis of the barycentric frame; as 
— 0, if we denote by x\ the coordinate of the first point (on 
the left of the origin) and hy xn that of the last one (on the right of the 
origin), then < 0 and xn > 0. Let us prove the theorem by finding 
a contradiction supposing that it is untrue. If it is true, from Von Zeipel’s 
theorem, the existence of a pseudocollision at t = 0 implies that / — )> oo 
for t — > 0 . From (3.41), as a consequence, R ^ oc too, for t to. Since 

the N points must always lie on the x axis and cannot change their relative 
order, R — \xn —xi\ and therefore also \xn —xi \ oo. From the condition 
J2i = 0, we see that xn does not have a limit for t ^ 0 (actually 

G. Saari: Singularities and collisions of Newtonian gravitational systems, Arch. 
Mech. Math., 49, 311-320, (1973). 

^^Saari: loc. cit., p. 319. 
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xn ^ oo for ^ > 0). In fact, supposing that this is not so and then that xn 
is bounded, from \xn — xi\ ^ oo we will have xi —oo . Since the centre 
of mass is fixed, this implies that 



N 

rrii Xi — —mi X\ oo. 

i=2 



But this means that there is some point Xi such that limsupx^ = oo. Since 
Xiv is to the right of all other particles, then limsupxiv = oo too, in contra- 
diction with what we assumed. Then, if the singularity is a pseudocollision, 
Xn does not have a limit. Let us now consider the question from another 
point of view. The equation of motion for mjv is 



rriN Xn — 



E 






rriN (xj — Xn) 
\xj - XivP 



Any point xi , with i = l,2, ...,A^ — 1, will be to the left of xjv , so that it 
will be always the case that xn <0: the function xn (t) will be concave 
downwards, and, as a consequence, xn will be limited for t 0, but this 
is in contradiction with what was just established, and so the theorem is 
proved. Let us now consider the third theorem:^^ 



A singularity at the time t = 0 is due to collision if and only if U ^ 
Oit~^l^ for t — > 0, for some positive constant a. 

The notation f ^ g used is intended in the sense that, if / and g are 
positive functions, after some time t there exist positive constants A and 
B such that Ag{t) < f (t) < Bg{t). Referring to the cited works for a 
proof of the necessity of this condition, let us see briefiy how the stated 
condition is sufficient. From the energy integral T — U = h and from the 
hypothesis of the theorem, it follows that T = O for t — > 0, from 

which = O that is, Vk = Vk = O ^ind, integrating between 

ti and t 2 (0 < < ^ 2 ), we have \vk {ti)—rk (^ 2 )! ^ A\t^!‘^ —t 2 ‘l‘^\ for some 

constant A. For ti and t 2 ^ 0, the right-hand side of this inequality tends 
to zero and the left-hand side does too; Cauchy’s criterion for the existence 
of the limit gives that every Vk tends to a limit for t 0. Therefore the 
singularity can be due only to a collision. Finally, we quote a result due to 
Saari^^ as the fourth theorem: 



The set of initial conditions leading to a collision in a finite time has zero 
(Lehesgue) measure. 

In other words, in an AT-body self-gravitating system, the collisions have a 
low probability of occuring. In summary, collisions, however improbable, are 

Wintner: op. cit., pp. 255-257. H. Pollard, D.G.Saari: Singularities of the N- 
body problem. I, Arch. Rational Mech. Anal.., 30, 263-269, (1968). 

G. Saari: Improbability of collisions in Newtonian gravitational systems, Trans. 
Amer. Math. Soc., 162, 267-271, (1971); 181, 351-368, (1973). 
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a considerable hindrance to the treatment of the A^-body problem, since, in 
general, we do not know how to exclude the initial conditions leading to one 
or more collisions. The only situation in which this is possible is in the case 
in which all the points collide at the centre of mass: this is the global collapse 
we will examine in the next section. 

Let us return as we said we would at the start of the section, to the ques- 
tion of the singularities that are not due to collisions. In 1974, McGehee^^ 
announced the construction of a non-collisional singularity in the rectilinear 
five-body problem; later on, Mather and McGehee^^ proved that there can be 
a noncollisional singularity in the rectilinear four-body problem. This result, 
however, must be intended (see Saari’s theorem, proved above) in the sense 
that this singularity occurs after an infinite number of (regularized) colli- 
sions. Finally in 1988, Xia^^ demonstrated the existence of a non-collisional 
singularity, not preceded by an infinite number of collisions as in the result 
by Mather and McGehee, in the spatial five-body problem. Xia considers two 
pairs, each constituted by two equal masses, plus a small fifth body. The two 
binary systems are on opposite sides with respect the xy plane and rotate 
in opposite directions. The fifth particle moves along the 2 ; axis (so that the 
total angular momentum is zero) and oscillates between the two binaries, de- 
termining an unlimited motion in a finite time. The system is represented in 
Fig. 3.1. The example given by Xia can be extended to cases with the same 
type of symmetry with any N > b. Another case of noncollisional singularity 
has been found by Gerver,^^ who considers 3n particles in the plane. 2n of 
them (all of the same mass) constitute n pairs covering circular orbits whose 
centres are at the vertices of a regular n-agon. The other n have very small 
equal masses and move from one pair to the other (see Fig. 3.2). When each 
of them moves near a binary, it gets kinetic energy from the pair and releases 
linear momentum, so that the binary becomes more bounded and its centre 
moves away from the centre of the n-agon. If one iterates the process, for 
suitable values of the initial conditions, the extension of the configuration 
will grow at each close encounter of a particle with one of the binaries and 
the system will become unlimited in a finite time. 



McGehee: Triple collisions in the collinear three body problem. Invent. Math., 
27 , 191-227 (1974). 

Mather, R. McGehee: Solutions of the Collinear Four Body Problem which Be- 
come Unbounded in Finite Time, Lecture Notes in Physics 38, ed. by J. Moser 
(Springer, 1975), pp. 573-597. 

Xia: The existence of non collision singularities in the Newtonian systems, Ann. 
Math., 135 , 411-468 (1992). 

L. Gerver: The existence of pseudocollisions in the plane, J. Diff. Eq., 89 , 1-68 
(1991). 
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Fig. 3.1 



Fig. 3.2 



3.4 Sandman’s Theorem 

As we have already said, for any N (the case N = S will be examined sep- 
arately), it is in general unknown how to exclude binary collisions. However, 
there is a celebrated theorem due to Sundman,^® whose content, incidentally, 
was already familiar to Weierstrass,^® that allows us to exclude the global 
collapse of the system. This theorem states that: 

The global collapse of the system may occur only in the case in which the 
total angular momentum vanishes. 

Let us, as a preliminary, establish an important inequality. From (3.24a), 
c = |c| = mfc Tk Vk sin i^k, 

k 

Ok being the angles between the vectors Vk and Vk\ from this we obtain 

c < TUfcrfcUfc |sin '&k\, 
k 

and, from Schwarz’s inequality, 

< E ruk rl E TRkvl sin^ dk^ (3.42) 

k k 

From (3.22) we then have, by differentiating with respect to t, 

/ = 2 mfc (rfc • Ufc) = 2 Vk cos 'dk, 

k k 

F. Sundman: Memoire sur le probleme de trois corps, Acta Mathematical 36, 
105-179 (1912). The results reported in this review had already been published 
in the Acta Societatis Scientiarum Fennicae, 34, 35. 

^^The letter by Weierstrass to Mittag-Leffler in which he speaks of this result (with- 
out proof) is published in Acta Mathematical 35, 55-58 (1912). 
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from which 

|/| < 2 ^ ruk Tk Vk I cos dk\ 

k 

and, again applying Schwarz’s inequality, we get 

P < ^Y^mkrl Y. rrikvl cos^ 'dk^ (3.43) 

k k 

Summing (3.42) and (3.43), we finally obtain 

+ P <Y'^krlY'^k vl, 

k k 

that is, 

^P + P<2IT, (3.44) 

which we call Sundman’s inequality. 

Let us now prove that, if the collapse occurs, it must occur in a finite time: 
that is, it cannot be the case that / — )► 0 for t ^ oo. In fact, referring to the 
expression of the potential [/, if every distance tends to zero {vjk 0) for 
t — )► oc, then U ^ oo as t oo. On the other hand, the Lagrange- Jacobi 
identity implies that / — > (X), since h (the constant energy) is finite. From 
a certain instant onwards, it must be the case that / > 1. Integrating twice 
yields 

where A and B are integration constants. This means that / ^ oo for 
t ^ oo. But this is the opposite of / — > 0 for t oo: the collapse must 
therefore occur in a finite time ti. 

We now give the proof of Sundman’s theorem. Assume that > 0 (the 
proof can obviously also be worked out in the case ti < 0 ). Then 

U — y oo , I — y oc , I — y 0 , for t — y 1 1 , 

in the case of collapse. Also / > 0, at least for a time interval t 2 < t < ti, 
moreover, since / > 0 and I {ti) = 0, in such an interval we also have / < 0 . 
Therefore, in the case of collapse, I{t) is a decreasing function. Sundman’s 
inequality (3.44) furthermore yields 

< 2IT. 

By substituting the Lagrange- Jacobi identity in this, we have 



Multiplying by the (positive) quantity — ///, we obtain 
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Integrating for t G [^ 2 ,^ 1 ) gives 

i (? In {l~^) < hi — + k, 

where k is an integration constant. Furthermore, 



Then, if / -> 0 for t ^ ti, c also must tend to zero; c being a constant of 
the motion, c = 0 always. With this the theorem is proved. 

Referring our mathematical model of N material points to the solar sys- 
tem, we find that the total angular momentum can be calculated with suffi- 
cient precision and can therefore be determined an invariant plane perpen- 
dicular to the angular momentum vector. Such a plane can be taken as a 
reference: for example, we can set the origin at the centre of mass and the 
2 ; axis along the angular momentum vector. A reference frame of this kind, 
already suggested by Laplace, did not find great favour among astronomers. 
It might be thought that, since the planets are not actually material points 
but extended bodies rotating around their own axes and therefore endowed 
with intrinsic angular momentum, the aforementioned plane, passing through 
the centre of mass, could assume various orientations, induced by subsequent 
changes of the rotation axes of the planets (precession, tidal friction, etc.). 
But, according to present estimates, about 98% of the angular momentum of 
the solar system is due to the orbital motion of the major planets (Jupiter, 
Saturn, Uranus and Neptune), and so this is not the case, and therefore the 
invariant plane remains effectively fixed with respect to the centre of mass of 
the system. 



o 2 / T 2 A: 

r 

- In (7-1) • 



3.5 The Evolution of the System for t 00 

Up to now, we have approached the AT-body problem with the disposition of 
one who wants to “solve” set (3.1b). We stated the conditions under which one 
can be sure that a solution does exist {existence theorem)^ we investigated the 
cases in which the theorem can(not?) be applied {occurrence of a singularity)^ 
and we reviewed all the possible first integrals or at least the eventuality that 
the order of the system could be reduced. At this point it is clear that, even 
in the most favourable case {N = 3) we are very far from believing it possible 
to integrate (3.1b) in the usual meaning of the term. Waiting for the study 
of particular solutions characterized by symmetry properties such as to allow 
their detailed analysis and the possible generalization to any N of some result 
we will find for N = 3, let us now see what can be said about the evolution 
of the system for t -T oc. 
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The problem is to try to assess the long time behaviour of the mutual 
distances of the material points and whether their number stays constant 
or not, assuming, at the same time, that the solution exists in the interval 
[0, oo). Such studies have been undertaken only recently, if we exclude the 
N = 3 case, studied in depth by Chazy^^ in the 1920s. In essence, apart 
some hints in the book by Wintner, the paper that, as it were, founded this 
field of research is that by Pollard in 1967,^^ followed by works by Saari and 
others. The bulk of the results depends on the application of the so called 
Tauberian Theorem^^^ whose importance for celestial mechanics Wintner had 
already suggested. The work by Pollard provided a classification of the final 
motions of a system of N material points on the basis of the sign of the total 
energy and it was shown that the behaviour of Wbody systems at non- 
negative energy was essentially a generalization of the two- and three-body 
problems. 

In 1970 Saari, again taking Pollard results, gave a classification of the 
final motions that did not depend on the sign of the energy. Moreover, he 
showed that, in the absence of so-called oscillatory or pulsating motions, the 
system evolves in a regular fashion: the N bodies divide into groups, inside 
which the mutual distances among the components are limited for t oo. 
These groups form subsystems characterized by a separation going as 
in turn, the centres of mass of the subsystems asymptotically separate from 
each other as ct^ where c is a constant vector. In the following, we give 
the results summarized above: the treatment is extremely sketchy, for most 
theorems we simply state them, referring to the original works for the proofs. 

The Behaviour of r and R for t oo 

For the minimum distance r the following is relevant 

Theorem 1: If h is negative, r stays limited. If h = 0, then r = O for 
t oo. If h > 0, r = O (t) for t -> <30. 

Chazy: Sur Failure du mouvement dans le probleme des trois corps quand le 
temps croit indefinitement, Ann. Sci. Ecole Norm. Sup., 39 , 29-130 (1922). 

Pollard: The behavior of gravitational systems, J. Math. Mech., 17 , 601-611 
(1967). 

^^The theorem by Tauber (1897) states that: if the series ao + aix -h U 2 X^ + . . . 
converges for \x\ < 1 and its sum converges to ^ for x ^ 1, then ao + ai -h U 2 + 

. . . converges to £ if nUn is infinitesimal for n -> oo. In general we say that a 
theorem is “Abelian” if it asserts something about an average of a sequence from 
a hypothesis on its ordinary limit; it is “Tauberian” if conversely the implication 
goes from average to limit. By extension theorems concerning integrals are also 
called Tauberian, being the continuous analogs of the series. See, D. V. Widder: 
An Introduction to Transform Theory (Academic Press, 1971), Chap. 8. 

^^A. Wintner: op. cit., pp. 428-429. 

G. Saari: Expanding gravitational svstems, Trans. Amer. Math. Soc., 156 , 219- 
240 (1971). 
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If < 0, the energy integral will be T = U — \h\^ and since T > 0, 
U > \h\ as well so that, by (3.40), the minimum distance r remains limited. 

Let us look at the cases h = 0 and h > 0. From the definitions of U 
and r , differentiating and using (3.40) we get 

\il\ < A u'^ ^ (mjTnk)^^^ \r-jk\, 

l<j<k<N 

SO that, using Schwarz’s inequality, 

|{/p < A ^ rrijmkrjk^. 

l<j<k<N 

Since the centre of mass is at the origin, 

rujmkfjk^, 



and 

|f/|2 < AU^^T, 

and therefore 

\il\ < AU'^T^/'^ (3.46) 

(in the preceding lines A indicates a positive constant, differing, in general, 
from the original one). Setting q = substituting in (3.46) and using the 
energy integral, we obtain 



lel < A. 




(3.47) 



If h = 0 , then |^| < A; integrating this yields < At, or < 

At so that, by (3.40), we obtain r = O (t^^^), as was to be proved. 

Now consider the case h > 0. We introduce the definition p = g when 
^ > 1 and, p = 1 when g < 1. For ^ > 1, (3.47) immediately implies that p 
is limited; in each interval in which g < 1, on the other hand, p = 0, p being 
constant. Therefore p is limited everywhere, so that p = 0(t). As g < p^ 
it follows that g = 0{t), or U~^ = O (t) and then r = O (t). This theorem, 
therefore, provides upper bounds on the increase of r, whereas nothing is 
known about the lower bounds. Equally, it is possible to obtain information 
about the lower bounds of R, but not on its upper bounds. 



Theorem 2: 7/ A > 0, R > At; z/ A = 0, R > At^!^ . 



If A > 0, from the Lagrange- Jacobi identity, since U is always positive, 
we have I > 2h. A twofold integration gives / > At‘^ so that, from (3.41), 
R > At {A always indicates a positive constant). If /z = 0, theorem 1 
implies that U > so that also I > At~‘^^^. A twofold integration 

gives I > At^^^. Inequalities (3.41) readily give R > This procedure 

cannot be applied in the case h < 0. 
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Relations Between r and R 



We will now try to establish some connection between the behaviour of r 
and R. The following holds. 

Theorem 3: If R = 0{t) for t oo, then r is limited in the mean from 
below; that is: 

r{u) du > A > 0, V t > 0. 




From Taylor’s expansion 

.. 

I{2t) = I{t) + tl{t) + - 7(0, t<i<2t, 

and from the Lagrange- Jacobi identity (from which 7 > 2 /i) it follows that 

I{2t) >ti + ht^ (3.48) 

always. As by hypothesis, R = 0(t), from (3.41), we have 7 = O(t^). From 
this and (3.48), we deduce that ti < At^, or i < At. Integrating the 
Lagrange- Jacobi identity gives 





du -\- 2ht -\- const. 



As I < At ^ this yields 
and, from (3.40), 



[ U {u) du < 

Jo 



f 



At 



du 

r{u) 



< At. 



This last equation and Schwarz’s inequality imply that 



0 e(OJ 



1 < / r(u)du [ ■ < At [ r{u)du. 

J Jo Jo ^(^) Jo 



With this the theorem is proved. 



Theorem 4: There exists a time t^ , depending on initial conditions, such 
that: , 



I < t VU 



t>to. 



(3.49) 



We observe that, if 7 q is the initial value of 7 for t — 0, the relation 

)du — ht^ 






(3.50) 
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is valid, since it is obviously true at t — 0 and can be verified for t > 0 
by differentiating. In fact (3.23) is obtained. Since U~^ < ^4r, as seen in 
theorem 1, t/ is always limited from below for t — > oc. This implies that the 
integral in (3.50) diverges for t ^ oo; therefore, there will exist a to such 
that, for t > to, 

(3.51) 

But, from Sundman’s inequality (3.44), we also get 
so that 

- / — t a/2 it < —ht^ ^ Vt>to. 

Adding t^ T to both sides of this last inequality and remembering that 
T —U = A, we obtain 



<ut\ yt>to, 

which implies (3.49). As a consequence, we can prove the following theorem. 
Theorem 5: For t — )► oc, j -> oo if and only if r — 0. 



Assume, from the start, that r 0. Then, from (3.40), C/ — > oo and, 
from (3.23), / — > oo. This implies that, VL > 0 , there exists a ti such that 
I > L, \/t >ti. Integrating twice and dividing by we have 



liminf ^ - L, 

- 2 ’ 



and, since L can be chosen arbitrarily large, I /t^ oo so that, from (3.41), 
R/t — y oo. 

Suppose now, instead, that -> oc; by (3.41) therefore oo, 

and, from (3.49), theorem 4, it follows that 



Vu + Vf OO. 



As T — U + then U oo also and r ^ 0. 

Observe, at this point, that the catastrophic event R/t oo cannot be 
due to the fast “explosion” of all the distances Vjk^ it not being possible, that 
is, that Tjk/t oo for every mutual distance Vjk. This would imply, in fact, 
that t/vjk 0 for every pair (j, fc), so that tU ^ 0, and, moreover, r/t ^ 
oo. But this would be in contradiction with what was proved in theorem 1. 
On the other hand, we cannot exclude that lim sup {vjk/t) = oo for every 
pair of points. 
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Coming back to the hypothesis of theorem 3, namely R = O(^), and 
therefore also = O(t^), it is obvious that, with greater reason, (since 
r < R), 

i?2^ = 0(C), t-^oo. (3.52) 

This can be proved with the following theorem. 

Theorem 6: Estimate (3.52) is always valid; moreover, except when h > 0 
and r is unlimited, 

r = O(t^), t — )► oc. (3.53) 

We omit the proof, for the sake of brevity. 

Null Energy and Positive Energy Systems 

For h = 0, there are exact results only in the cases N — 2 and N — 3. In 
the first case the motion is parabolic and (2.46) hold. From the first of these 
we have that, for large t, r goes as u‘^] from the second we then have that 
goes as t, so that r = At^!"^ with ^4 > 0, for large t. For N — 3, Chazy^^ 
showed that there are two possibilities: 

r -- R Af^^ 

or 

r ~ 0(1), R ^ At. 

For any N the following can be stated. 

Theorem 7: If h = 0, then either 

U ~ I ~ 

4 

for some positive constant a, or r = 0{t^^^) and lt~^!^ oc. 

For the proof, exploiting a nonlinear Tauberian theorem, we refer to the 
already cited paper by Pollard. 

In the case h > 0, fundamental results were obtained by Chazy.^® He 
proved that the limit 

r 

lim — = L 

t->oo K 

always exists and is finite. Moreover, if L = 0 , it holds that 

(3.54) 

R 

In the case L 0, a complete and exact theory about the behaviour of the 
systems can be obtained, and, in particular 

Chazy: loc. cit. 

Chazy: loc. cit., Footnote 30 
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~ y, / - hf. 

In the case L = 0, however, it can be inferred from (3.54) only for N = 3 
that 

r = t ^ oo, (3.55) 

holds. 

Up to now we have considered results by Chazy. Pollard proved that (3.55) 
always holds when L — 0, even if N ^ 3. In addition, he obtained some new 
results, starting from the following lemma, which we only state. 

Lemma: Whatever the sign of the energy, the limit 

lim (3.56) 

always exists. If i = 0, then (3.55) holds. 

This lemma allows us to prove the first part of the following theorem. 

Theorem S: If h > 0, then either U ^ a/ 1 for some positive constant a, 
or r — 0(t^/^). In the first circumstance, I = ht“^ at \nt 0{\nt), for 
t oo; in the second I > ht“^ At^^^. 

Obviously the second part follows from the first; integrating twice and 
the use of the Lagrange- Jacobi identity is sufficient. 



Escape from the System 

In the case for which /i > 0, integrating the Lagrange- Jacobi identity (3.23) 
twice, gives / oo for t ^ oo. As a consequence, lim supr^ = oo, for 
some k. However, this does not necessarily imply the escape of a particle from 
the system; this occurrence is warranted only if r/e oo, for some k, so that 
further conditions must be added. In this respect there is the following. 

Theorem 9: If h > 0 and U^{u)du < oo, then at least one particle 
leaves the system. 



From the equations of motion (3.1a), we have 



Tk\ < 






Integrating this gives 

\vk{ti) - Vk{t 2 )\ < A" [ U^{u)du. 

Jti 

For ti and t 2 -> oo, the right-hand side of this inequality vanishes; the same 
must happen for the left-hand side. For every k then, the finite limit 
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lim Vk = t-k 

t— >oo 

exists. As Vk = ^k, we deduce that Vk/t ik^ for each A:, and similarly 
for the absolute values Vk/t Ik’ It remains to show that the ik are not 
all zero, that is, that at least one of them does not vanish. Actually, if they 
should all vanish, then Vk 0, V/c, and then T 0 and thus U — /i, 
which is impossible, U having always to be positive. 

Theorem 10: If h > 0, in the case for which U ^ ajt, with a > 0, at least 
N — 1 particles leave the system. 

The first part of the proof (that is, the existence of 

lim Vk = £k 

t-^oo 

for each k) proceeds exactly as in the previous case. It remains to prove that, 
now, at most only one of the Ik can be zero. If two were vanishing, e.g. Ii = 0 
and £j = 0, we would have — )> 0 and rjjt 0, so that rij/t 0 and 

then {tU)~^ — > 0, which contrasts with the hypothesis U ~ a/t. 

Theorem 11: If I ^ oo, for t oo , and the condition 




is satisfied, then at least two particles leave the system. 

We again emphasize that we speak of the escape of a particle from the 
system only when its distance from the centre of mass tends to infinity, when 
t ^ oo . We say that the system disintegrates or is dispersive when all the 
mutual distances become infinite. It has never been proved that a particle 
must escape a disintegrating system, since it is not true that a system from 
which all particles escape, disintegrates. A classical counterexample is that of 
the three-body problem (the hyperbolic-elliptic case) where a pair of particles 
“runs” away in one direction and the third particle in the opposite direction. 

Final Motions 

Generally speaking, we can classify the final motions, that is, those for t 
cx), in three broad groups: limited motions, oscillatory motions and unlimited 
motions. By limited motions we mean those for which: 

a) rij{t) ^ 0 for every i ^ j and any t; 

b) with A a positive constant, for any t. 

In this case, therefore, the system remains confined to a limited region of 
space; the motion is also said to be “stable”. For this to happen, the total 
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energy {h) must be negative (Jacobi stability criterion). The proof is straight- 
forward. We have already seen that, for /i > 0, integrating the Lagrange- 
Jacobi identity yields I — > oo, and therefore, from (3.38), it cannot always 
be the case that \rij{t)\ < A, \fi ^ j. Obviously the condition is only 
necessary; therefore h < 0 is not sufficient to warrant the ‘‘stability” . 

Let us now introduce the definition of oscillatory motion in the case of 
any as a generalization of the case A^ = 3. In the three-body problem there 
can be initial conditions that lead to motions with the following properties 
for t ^ oo 



lim sup 


^23 — 


oo, 


lim sup 




) >0, 


V^23y 


/ 


lim inf ( 




= 0. 


\ 







In the case of any N whatsoever, the masses nik^ rrij^mi undergo oscillatory 
motion if, for t — )> oo. 



lim sup 




00, 


lim sup 


( 


) >0, 






/ 


lim inf ( 




1 =0. 


\ 


^rij) 





Finally, there will be unlimited oscillatory motion if — > oo and r 0; if 

this happens in a limited interval of time, we speak of unlimited motion in a 
finite time; otherwise we speak of super-hyperbolic expansion. 

The table aside^^ lists the various possible types of final motion. It must 
be noted, however, that such considerations cannot be taken as definitive for 
all cases. 

The systems with a hyperbolic expansion separate into two or more sub- 
systems whose mutual distances grow as t, whereas the distances between the 
components of the subsystems go, at most, as For every subsystem, the 
energy and the angular momentum tend to a limit; for the energy, the limit 
will be negative or zero; in this last eventuality the subsystem will tend to 
a central configuration (see Sect. 3.8). The central configuration will be the 
limit in other cases too: motions of the A^-parabolic type or triple or multiple 
collisions. 

^^The existence of oscillatory motions in the three-body problem Wcis conjectured 
by Chazy (see Footnote 30) to account for the fact that, for A < 0, I can at 
the same time be limited and unlimited for t — > oo. The proof was given in 1960, 
by Sitnikov - K. Sitnikov: The existence of oscillatory motion in the three-body 
problem, Dokl Akad. Nauk. SSR^ 133 , 303-306 (1960). 

^^The table, due to C. Marchal, is in C. Marchal; Qualitative analysis in the 
few body problem. Proceedings of the 96th Colloquium of the lAU^ ed. by 
M. J. Valtonen (Kluwer, 1988), pp. 5-25. 
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Final motions in the V-body problem 


Singular 

types 


Triple or multiple collisions R ^ Rp; r ^ 0 
Unlimited expansion in a finite time R ^ oo; r 0 
Super- hyperbolic expansion R/t ^ oo; r 0 


Hyperbolic 

expansion 


Tj = Ajt + 0(t2/3); 

from 2 to V subsystems, R ^ t^ Mj 


Parabolic 

and 

sub-parabolic 

types 


V-parabolic types = 0; R,r ^ t^!^ 

Parabolic types, h R — r limited 

subsystems, < 0, = 0(^^/^), r limited 

limited motions, h R — 0(t^/^), r limited 

oscillatory motions, < 0, = 0(t^/^), r limited 



3.6 The Virial Theorem 



Sometimes it happens that theorems, formulated in a given context with the 
aim of providing a theoretical assessment of some specified ideas, find ex- 
tensions and applications in fields apparently very different from the original 
one. This is the case for the so-called virial theorem. With this theorem, pub- 
lished in 1870, Clausius^^ intended to arrive at an almost definitive statement 
about a long time of research on the mechanical origin of heat. Very soon, 
however, its importance was appreciated in applications to the V-body prob- 
lem, particularly with regard to stellars dynamics. It should be emphasized, 
however, that the theorem differs, in some features, from typical mechanical 
theorems, because Clausius introduces a concept with a statistical nature: av- 
eraging over time. This fundamental feature is not always taken into account, 
and, as a consequence, it may happen that the virial theorem as described 
by some authors is mistaken for a particular case of the Lagrange- Jacobi 
identity (3.23). 

To demonstrate the theorem, we shall repeat the procedure already fol- 
lowed to demonstrate (3.23); we start from system (3.1b), which we again 
write as follows 



mkVk = 



dU 

dvk 



(3.57) 



Clausius: On a mechanical theorem applicable to heat, Phil. Maq. Ser. A, XL, 
122-127 (1870). 

Eddington: The kinetic energy of a star cluster, Mon. Not. Roy. Astron. Soc. 
76, 525-528 (1916); J. Jeans: Problems of Cosmogony and Stellar Dynamics^ 
(Cambridge University Press, 1919), pp. 188 ff.; H. Poincare: Legons sur les Hy- 
poteses Cosmog oniques^ (Hermann, Paris, 1913), p.94. 
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Taking into account the identity 



1 ^ 
2 dt^ 



(vk ■ rk) = |r-fc|^ +rk-Tk 



and multiplying (3.57) by rk, we obtain 



1 ^ 
4 dt^ 



{rukrl) = \mk \vkf + \'>'k 



du 

dvk' 



Summing over k from 1 to N, we get 



1 ^ 
4 dt^ 



N.N 



k=l 



k=l 



Y,^k- 



du 

dvk 



1 d? I _ 1 A dU 

~4 dfi ^ 2 ^ dvk ' 

k=l 



(3.58) 



We now introduce the idea of time averaging. Given a quantity A (t), its 
average over the time interval (0, t) is given by 



{A)t = \ f A{t) dr. (3.59) 

^ Jo 

For t tending to oo, we also have 

{A)oo = lim 1 /" A{t) dr. (3.60) 

t-s-oo t Jo 

In typical applications, average (3.60) is replaced by that defined by (3.59), 
taking into consideration “very large” times. As we shall see, in some cases, 
matters are very tricky and erroneous extrapolations may be generated. 

If now, in (3.58), we average over a time t, we get 



1 i{t) - j(o) 

4 t 




(3.61) 



In the case for which our A^-body system is subjected to a periodic motion 
of period t , it is clear that, for t = t or multiples. 






Y^rk 



dU 

dvk 



t 



The same result may be obtained, however, for any motion (not periodic), 
when i{t) — /(O) is negligible compared with a very large time (at the limit, 
infinite) in the denominator. The result will be exact when 



lim = 0. (3.63a) 

t-AOO t 
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Then 




holds if and only if (3.63a) is valid. Equation (3.63a) can be rewritten as 
i{t) = o(t), t ^ oo, or I (t) = o(t^), t oo; again, for (3.41), (3.63a) is 
equivalent to 

R (t) = o(t), t ^ 00. (3.63b) 

The quantity on the right in (3.62) was called by Clausius the virial The 
content of the theorem is then as follows: 

The average total kinetic energy (over an infinite time) equals the virial 
of the system if and only if (3.63a, b) is valid. 

As one can see, condition (3.63a,b) is far weaker than the condition that the 
system be bounded, which, as a rule, is given as necessary for the validity of 
(3.62); for instance, R ~ satisfies (3.63a,b) and thus (3.62) is valid. 

If now, as in the case of the Lagrange- Jacoby identity, we apply Euler’s 
theorem, taking into account that U is the Newtonian potential, we have 

{T)oo^\{U)oo, (3.64) 

which is the expression of the virial theorem in the case of the Newtonian 
A/'-body system. Applying energy conservation (3.21) and taking into account 
that {h)oo = h, h being constant, we also get 

(r)oc = -h, (3.65) 

(C/)oo = -2h, (3.66) 



which are equivalent to (3.64). 

It was shown by Milne^^ that (3.64) maintains its validity even if dissipa- 
tive forces are present, if these are proportional to the velocity of the mass 
points involved. If we add to the right of (3.57) a term —Ck Vk (where Ck is 
a positive constant), we have, applying the same procedure as above, instead 
of (3.58) 



1 d^ I 1 

4 4 dt 



N 



k=l 



1 ^ 

k=l 



dU 

'^k ’ o 

OVk 



For 



lim 

t — >^CXD 



Eti Cfe [rl{t)-rl{Q)] 
t 



= 0 , 



(3.67) 



one again obtains (3.64) for averages over an infinite time. It is clear, however, 
that (3.67) is a stronger condition than (3.63). In fact, now R‘^{t) = o{t) 



A. Milne: An extension of the theorem of the virial, Phil. Mag. Ser. d, 50 , 409- 
414 (1925). 
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must be true. Dissipative forces proportional to the velocity can then modify 
the system, by decreasing the value of A, but not the relation between the 
averages of T and U . 

Before discussing the use of the virial theorem in the case of large N 
(dynamics of stellar systems), we shall first take a look at how to apply it 
in the two body-problem. It appears from (3.65) that the case h > 0 does 
not fulfil the conditions of the theorem; we shall therefore consider only the 
elliptic and parabolic motions. Since the first case refers to periodic motion, 
we can confine our consideration to the period t = 27r/n, where n is the 
mean motion. The equations to be taken into consideration are (2.50) and 
Kepler’s equation (2.53). From the definition of the eccentric anomaly 



we obtain for the potential 



Li , 

— at = r du^ 
a 



U {t) dt = — dt — y/JTd du, 



and 



1 1 u(f) 

{U)t^- / U{r)dT=- / ^du = ^^. 

t Jo t Jq t 

Taking as a reference the apse line (T = 0), Kepler’s equation becomes 



nt — u — e sini^, 
and, by substituting it in (?7)t, we shall have: 





(3.68) 


a u — e smu 


')t=h + {U)t, we get 




H {u + e sin u) 
^ 2a u — e sin u 


(3.69) 



For t = t — 21^ jn or multiples thereof, sinu = 0, and then 



{U)j = 



a 



{T)t = 



ii 

2a’ 



which, since h — —fi/2a^ coincide respectively with (3.66) and (3.65), with 
t or multiples of t instead of cx). 

The case h = 0 (parabolic motion) must be handled with greater caution. 
First of all, for any finite t it will be always the case that {T)t — {U)t: since 
h = 0. This could lead to the conclusion that the virial theorem is not valid, 
since for t however large (3.64) could never be approximated. It is not like 
that, however. Let us, then, calculate {U)t and {T)t explicitly. From (2.46), 
referring also in this case to the apse line, we have 
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and, recalling that y/Jl dt — r du^ we get U {t) dt — du^ so that 
u (i) _ n ^ 2/3 



{U)t = vn 



6 ^ ^ 2/i 



t — y oo. 



Therefore 

lim {U)t = 0, (3.70) 

t-^oo 

and also 

lim (T)t = 0. (3.71) 

t-^oo 

Both (3.70) and (3.71) satisfy (3.65), (3.66), since h = 0. Thus in the 
parabolic case as well the virial theorem is valid. On the other hand, since 
asymptotically {U)t ^ also R ~ and consequently condition 

(3.63b) is satisfied. The seeming contradiction lies in the fact that whenever 
one looks for an approximate verification of the theorem, considering the val- 
ues of {T)t and {U)t for finite t, one has, for the averages, values equal 
and different from zero in disagreement with (3.64). The passage to the limit, 
stated in the theorem, on the other hand, allows us to verify that (3.65) and 
(3.66) are equivalent to (3.64). If the case N > 3 is considered, instead of 
N = 2, from what we have said in the preceeding section, the virial theorem 
is satisfied for motions characterized by h < 0 ( parabolic and sub-parabolic 
motions) . 

As we have said, the most typical application of the virial theorem refers to 
the case of very large N. We shall now seek to outline this case and thereafter 
pinpoint what the risks of careless application can be. It is clear that, for any 
application, averages over a finite time, for instance the real age to of the 
system, should replace (T)oo and (/7)oo; this, however, is not yet sufficient, 
since we cannot verify the soundness of the hypothesis 

(T)to ~ (7^00, {U)to ~ (t^)oo 

in the cases which one can actually meet in nature. What we can verify, 
albeit with a very rough approximation, is wether the system is in a steady 
state, i.e. if it is confined to a given region of space, and the positions and 
velocities of its mass points have a stationary distribution. Let us try to 
clarify this concept. We assume that our system may be represented in a 6- 
dimensional Euclidean space R® , by introducing a correspondence for every 
sextuple {xk^Vk^ ^fc), which gives the position and velocity of a point 

mass rrik^ with a point of the space so defined. If the system is kept bounded 
in ordinary space, the velocities will also remain bounded, and consequently 
the representative region of the system in R^ will be bounded. Let us split 
this region into a suitable number of small 6-dimensional cells; if, as time 
passes, the representative points in each cell remain constant in number, we 
may conclude that the distribution of the positions and velocities has not 
changed and that therefore the system is in a steady state. 
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It is immediately apparent that a system can be considered steady or not 
depending on the dimension chosen for the side of the cells. The cells must 
be large enough to contain a sufficient number of representative points to 
provide meaning for a statistical approach, and at the same time, however, 
there must be a sufficient number of cells to allow evidence to be given for 
possible evolution of the system. It is clear, therefore, that the steadiness of 
the system is a concept that is strictly dependent on the “graininess” assigned 
to it. This being the case, if we judge that a system has reached a steady 
state, then its total kinetic energy and total potential will stay practically 
constant in time (with the chosen “grain”). The adverb “practically” means 
that, in fact, both T and U will fluctuate, with fluctuations of the order of 
about flxed values T and U. We shall use these values T and U in 
the form {T)oo and (I/)oo and we then say that the system is virialized. 

Let us now deflne, at a certain instant of time, 

^ Ef- ■">)’ 

' ^ mj-TTlfc 5 ^ ’ 

\rj-Tk\ ^k=l 



then let us take 

1GM2 
“ 2 f ’ 

with, as usual, M = In the same way. 



\vkf = 

k=l 

We adopt for the virial theorem the form 



2T=U. 



(3.72) 



Then 



and 



— 1 

Mv^ = -G^ 

2 r 



2rv‘^ 

~G 



M = 



(3.73) 



Relation (3.73) as a rule is applied to evaluate the total mass of a stellar 
system (globular cluster, galaxy, etc.) considered, within a given approxima- 
tion, stationary and consequently virialized.^^ Equation (3.72), by which we 
“practically” express the virial theorem, hides, if not interpreted with due 
care, a trap. Let us see what that trap is. 



^^For the technical details on the evaluation of r and and for a comparative 
discussion with the results obtained by the mass-luminosity ratio, we refer the 
reader to C. W. Saslaw: Gravitational Physics of Stellar and Galactic Systems 
(Cambridge University Press, 1985), Sect. 41. 
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Suppose that, rather than considering T and U fluctuating around the 
values T and 17, we think that “really” 2T{t) = U{t) for any non-degenerate 
interval of time. In this case the following theorem is valid: 

If 2T = U for any finite time interval, then T = ~h, U = —2h, U = 
0, 7 = 0, I{t) =7(0), VT 

From energy conservation, it follows immediately that T = —h, U = —2h, 
and then f7 = 0. Note, by the way, that in this case h must be negative. 
By inserting the results in the Lagrange- Jacobi identity, we get I{t) = 0. 
Integrating this yields 

7(t) = 7(0)t + 7(0), Vt. 

If 7(0) ^ 0, then, either for t oo or for t — oo (depending on the sign 
of 7(0)), I{t) — oo as well. This contradicts the fact that 7 > 0 always. 
It follows, then, that 7 (0) 0, and therefore I{t) = 7(0), Vt. Furthermore, 

we can demonstrate the following corollary: 

If 2T = U over a finite time interval, then, V t , 



G rui ruj 

~2\hT 



< Tij {t) < 



mi y mj 



(3.74) 



The result is that the components of the system can neither get closer than 

a given limit nor leave the system itself. Let us see a demonstration of this. 

First ^ 

G mi mj 



from which 

In the same way 
from which 



2|hl = [/ > 






GmiTTij 

/(O) = I{t) > rtiirl. 



n{t) < 



mi 



rj{t) < 



im 



and, owing to the triangle inequality, 



ra < 



m 

mi 



m 

mj ' 



A typical motion which satisfies (3.74) is provided by a relative equilibrium 
solution (see Sect. 3.7). One can further introduce the hypothesis'^^ (not yet 

^^The above considerations and the theorem are due to D. Saari. See D. Saari: On 
bounded solutions of the n-body problem, in Periodic Orbits, Stability and Res- 
onances, ed. by G. E. O. Giacaglia, (Reidel, 1970), pp. 76-81. 
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demonstrated) that the motions which satisfy (3.74) come without exception 
from relative equilibrium solutions. In any case, it appears, from what we 
have shown that a “naive” interpretation of (3.72) leads us to approximate 
the system under study with a model that: 

1) has negative energy, 

2) satisfies (3.74); 

i.e. with a model characterized by very peculiar features. 

Let us add that, when the system is studied in the Hamiltonian formalism, 
the virial theorem is formulated as 




the virial being given by 




3.7 Particular Solutions of the AT-Body Problem 

Since a general solution of system (3.1a,b) cannot be given, great impor- 
tance has been attached from the very beginning to the search for partic- 
ular solutions where the N mass points fulfilled certain initial conditions. 
Historically, mainly with Euler and Lagrange, the work has centred on the 
three-body problem and thus on the exact solutions that could be obtained 
with particular symmetry properties. In the celebrated paper we have already 
mentioned, Lagrange demonstrated that, in the case of three bodies of fi- 
nite mass undergoing mutual Newtonian attraction, as many as five different 
configurations exist such that, under suitable initial conditions, the ratios 
of the mutual distances stay constant. In all these configurations the bodies 
describe similar conic sections around the common centre of mass: in three 
configurations the bodies always lie along the same straight line, while in the 
other two they are at the corners of an equilateral triangle. These configura- 
tions are called, respectively, the collinear solution and equilateral solution; 
the first one had been previously found by Euler. For N > 3, there are then 
solutions which extend to any N the Euler and Lagrange solutions: these are 
homographic solutions. This is the topic which we will deal with in this sec- 
tion, postponing to the section devoted to the three-body problem a detailed 
study of the Euler and Lagrange solutions. First of all, following Wintner,^® 
we list some definitions and considerations. 

^^See Footnote 8 in Sect. 3.2. 

^^L. Euler: De motu rectilineo trium corporum se mutuo attrahentium, Novi Comm. 

Acad. Sci. Imp. Petrop. 11 , 144-151 (1767). 

Wintner: op. cit., Chap. V. 
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Planar Solutions 

We call a solution of the problem of A^-body motion planar^ if a plane 77* 
containing all the bodies at any time exists and furthermore has a time- 
independent fixed position with respect to the inertial barycentric system. 
One can immediately see that, in the case of c ^ 0, i.e. when the invariant 
plane exists, then 77* coincides with the invariant plane. In fact, if 77* exists, 
it contains the centre of mass, and we can take it as the xy plane of the 
barycentric inertial system; since the motion occurs entirely in this plane, all 
the Zk and the velocities Zk will be zero as well as the components Cx and Cy 
of the total angular momentum. Therefore |c| = Cz at any time: the plane 
77* is at any time perpendicular to c and coincides with the invariable plane. 
Obviously, planar solutions may even exist in the case c = 0. 



Flat Solutions 

We call a solution of the Wbody problem flat if at any time a plane IJ = II (t) 
exists which contains all the bodies at that time. Of course, every flat solution 
is not necessarily planar: for N = 3 this is trivial, since every solution is 
flat. For examples of non-planar flat solutions for > 4 , see Wintner.^^ 
For the case where the invariable plane does not exist (c = 0), the follow- 
ing proposition is true: 

If the invariable plane does not exist, any flat solution is necessarily pla- 
nar. 

It must be stressed that the condition c = 0 is a sufficient condition for 
the planar motion, but not necessary. From the proposition one obtains as 
a corollary that, in the case N = 3, the condition c = 0 implies that the 
motion occurs in a fixed plane. Referring to Wintner^^ for the demonstration 
for any AT, we give below the demonstration for 77 = 3. 

Let us consider the plane that contains the three bodies at an instant 
t = to. Since the centre of mass also obviously belongs to this plane, we 
can take it as the fixed xy plane of our inertial barycentric system. If the 
three-body plane does not remain fixed, it will rotate around the centre of 
mass, and then at the time t = to^ although Zk = 0 {k = 1, 2, 3), in general 
i/c 7 ^ 0 (fc = 1, 2, 3). Therefore, the conservation of angular momentum (equal 
to zero) 'f'k X ~ 0, gives us, at t = to 

3 3 

mk Vk Zk=0, ruk Xkh = ^ 

fc=i fc=i 



^^A.Wintner: op. cit., note to Sect. 325. 
Wintner: op. cit., Sect. 326. 
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and furthermore the centre of mass being at rest, we get 

3 

'^mkZk = 0. 

k=l 

We then have a homogeneous system in the three unknowns miii , m2Z2, 

yi (mizi) + y 2 {rri 2 h) + ^3 {rn^h) = 0 , 
xi (miii) + X2 {m2Z2) + Xs (msis) = 0, 
rriizi + m2Z2 + m^zs = 0 . 

As is known, we can have: 
a) the trivial solution 

miZi - m2Z2 = mszs = 0 , 



or 

b) infinite solutions 0 if 



yi V2 ys 

Xi X 2 Xs 

1 1 1 



= 0 . 



As the latter is the condition that implies that the three points lie on a 
straight line at the time t = to, we are allowed to rotate the system around 
this line (which obviously contains the centre of mass) in order to have zs — 0 . 
Since the case ii =0, Z2 = 0 , zs = 0 is already included in solution (a), all 
that is left is (from the remaining equations) yi — y2-> x\ = X2. However, this 
case, the point P\ coinciding with P2, contradicts the assumption made at 
the beginning. It is therefore demonstrated that the only acceptable solution 
is zi = Z2 = Zs = 0 . The existence and uniqueness theorem then warrants 
that our plane stays fixed. 



Collinear Solutions 



We say that the N mass points are in syzygy, or are collinear, at a certain 
instant t — to, if at that instant they all lie on the same straight line. 
Obviously, this does not ensure that they will remain there for t ^ to. 
However, one can demonstrate that, if the invariable plane exists, then at 
the time t = to that line lies in the invariable plane. In fact, since at 
t = to every pair of points is collinear with the origin, at that instant 
^2 X = O7 V i,k = 1,2, ...,A'. Scalarly multiplying by Vi = f^, also 
Vi X Vk ’ Vi — 0, and then n x • Vi = 0. Finally, summing over i we get 




• r/c = c • r/, = 0. 
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Since c • = 0, V A:, is, for c ^ 0, the equation of the invariable plane, our 

thesis is thus proved. 

If all the N points always remains on the same straight line A = 7l(t), 
which, however, may vary with time, we say that we have a collinear solution. 
A collinear solution is obviously flat; according to what we have seen above, 
at any instant, A{t) (which, for instance, may rotate around the barycentre) 
must lie in the invariable plane, if c 0, and therefore it is planar. On the 
other hand, if c = 0, already we know that a flat solution is planar. Therefore, 
in all cases a collinear solution is planar. 

Rectilinear Solutions 

A solution will be called rectilinear when the straight line A* on which all 
the points lie (see the preceding case) stays fixed with time with respect to 
the inertial bary centric system. In such a case, however, the solution cannot 
exist for all times — cx) < t < + co, without there being a collision at a finite 
instant t = t\ between at least two of the N bodies. In fact, if we choose the 
X axis of the reference system to coincide with A* and consider the masses 
mi, m 2 , . . . , ruAT arranged from the left to the right, the A'-th mass will have 
a coordinate > 0, the barycentre being at the origin. On the other hand, 
as tun is attracted towards the left by the remaining TV — 1 masses, Xiv < 0 
always. If we now consider the function (T), it is clear that neither x^v > 0 
nor Xiv<0for— oo<t<+oo can hold; therefore either the solution exists 
only for a limited interval of time or at least one of the denominators of the 
potential vanishes at a finite time t = ti, and then the equations of motion 
are no longer defined. 

Homographic Solutions 

A homographic solution Vk ~ Vk {t) of system (3.1a,b) is characterized by 
imposing the condition that the configuration formed by the TV bodies at the 
instant t with respect to the inertial barycentric system remains similar to 
itself as t varies. This implies the existence of a scalar function X = \{t) and 
of an orthogonal 3x3 matrix i7 = 17(T) such that, for any value of k and t, 

rk{t) = A(t) I7(t) rl + r. (3.76) 

Here A (t) represents a dilatation and 17 (t) a rotation; the vector r which 
represents a translation must be considered identically zero because of the 
condition 

N 

^ mkVk = 0 
k=i 

for the coordinates of the barycentre. Finally represents the value of 
Vk {k = 1,2, ...,TV) at an instant T = to conventionally assumed as the 
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initial instant. Therefore it must also be the case that A° = A (to) = 1 and 
17° = 17(to) = 1. 

Clearly, the homographic solutions represent a rather restricted class of 
solutions of the A^-body problem, since system (3.1a,b), which is of order 
6 N, must be satisfied by 1 + 3 scalar functions and by the 3 N integration 
constants represented by A(t), 17(t), respectively. In homographic motion, 
the relative positions of the bodies are independent of the time; that is, their 
radii vectors have, at any instant, a constant ratio. If a homographic motion 
is such that f2{t) = 1, Vt > 0, and then rk{t) = that is, if the 

motion consists of a pure dilatation or contraction, the solution is said to be 
homothetic; if, on the other hand, the motion is such that \{t) = 1, Vt > 0, 
then it consists of a pure rotation and the solution is said to be a relative 
equilibrium solution: the points are at rest with respect to a reference system 
rotating around the bary centre. 

Let us now determine the fundamental equations of homographic motion; 
to do this, let us introduce, in addition to the inertial barycentric system 
whose axes have been called (x,y,z), a second system (^, ry, C), whose axes 
can expand or shrink, rotating around the origin of the former. The vector 
pertaining to the A:-th particle at the generic instant t is given by = 
{^kiVk: Ck)] at the instant t = to ^ we have 

rkito) =rl = Qkito) = qI- 

For a generic t, instead we have 

Qj,{t)^\{t)rl, A: = 1,2,..., TV. (3.77) 

If we call (jj = the angular velocity of the system with 

respect to the inertial barycentric one (x, 2:), we have that: 

a) for A = const. > 0 and u — const. 7^ 0, the motion is of relative 
equilibrium; 

b) for A = A(t) >0 and a; = 0, the motion is homothetic. 

For the moment of inertia, 

7 ° = ^ rrifc \rl\^ = Y^mk [{xlf + {ylf + {zlf] 

k=l k=l 

at the initial instant to- Instead, in the system (^,r/, ^), 

I^Y^ruk \Qk\^ ^'^mk + vl + cl) , 

k=l k=l 

the moment of inertia at the generic instant t. Prom (3.77), one sees imme- 
diately that 



/ = r. 



(3.78) 
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The components of the velocity of the k-th mass point, in the system (^, ry, C), 
will 

^ [(Cfci Vk^ Cfc)] Vk^ C/c) ^ (^i5 ^ 3 )] 7 {3.79a) 



that is, 



v^k = ^ - ivk - Cfc W 2 ) = ^ - A (yl Ws - zl 0 J 2 ) , 

Vnk = ^ - (Cfe oji - Cfe W 3 ) = 2 /fc ^ - A (z^ - xl W 3 ) . 

= ^ - {^k uj2-r]kOJi)^ zl^-\ {xl U2 -vloJi) 



and the kinetic energy is 



(x|fc + + vik) . 



fc=i 



(3.79b) 



If we call Qjk the distance between any two masses rrij and m/^, then 

Q% = fe- - ^kf + ivj - Vkf + (Ci - Ckf , {j,k = 1 , 2 ,..., N) 
and, from (3.77), Qjk = Therefore, for the potentials. 



u = -u^, 



since 



t'- E 



rrij rrik 
Qjk 



t'”- E 



rrij rrik 



(3.80) 



l<j<k<N <k<N jk 

with G = 1. Let us now consider Lagrange’s equations: 

dt[dek) dg, - ’ 

where C — T+U . By substituting and performing the calculations, we obtain 

d 



{ul+wl) 



-Vi 



}? UJ1U2 — (A^ CJ3) 






d 



}? (Ji (J3 + — (A^ LO2) 



1 -^k 

2^ rrik 



Xl - Xl 



A 



ki^j 






(3.81a) 



T. Whittaker: op. cit., p. 17 
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cl; 2 T 37 (A^ Ct^a) 
at 



A (wg + Wi) 






A o;2 (^3 



dt 



(A^ ^i) 



1 



J^rrifc 



k=jij 



Vk - Vj 

{r%f 



(3.81b) 



A^ LOiLU^ — — (A^ LU2) 
at 






A^ + 37 CUi) 

at 






A-^ - A^ (wi +w|) 



1 - Z° 



(3.81c) 



Equations (3.81) are the equations of the homographic motion for the j- 
th mass point. Now we shall study them, trying to draw out all possible 
information, in the three cases of: 

a) collinear homographic motions, 

b) flat homographic motions, 

c) spatial homographic motions, for which N > 4. 



a) Collinear Homographic Motions 



If all the N points are on a straight line, which we take as the ^ axis, then 
Vk~^k~ initial instant t — t^^ and also r]k = Ck = 0, V/c, 

at any subsequent instant of time. In this case, it is possible to choose the r] 
axis in such a way that <^2 = 0- Then (3.81) become 



d^A . 2_ 1 1 
|(A^c.3)=0, 









A^ uji uj^ — 0. 



(3.82) 



From the last two equations one has that either (Ja = 0 and lui ^ 0, or 
c <;3 ^ 0 and uj\ = 0. In the first case, since o ;2 = c^a = 0, the ^ axis is 
fixed and the motion then occurs on a straight line fixed in space; in fact, 
the total angular momentum of the system vanishes, as can be seen since 
= 0 and Vjjk = ~ motion is homothetic. In the second 

case A^ ljs — const. = a, and the ^ axis will no longer be fixed in space but 
rotate around the ( axis, which is now fixed. The motion will take place in 
the plane C == 0 therefore it will be a planar motion, the plane being 
fixed in space. Since the first of equations (3.82) does not contain cui, we can 
deal with both cases by substituting for cvs the expression a/A^, meaning 
that the first case corresponds to o; = 0. Then 



A 

dt“^ A^ 



1 1 
A2 X° 






XI - X, 



(r°kr 



(3.83) 
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which, since A 7 ^ 0, we can rewrite in the form 



d‘^X 

dt‘^ 



T 



1 ^ Xj^ Xj 



where it is evident that, the right-hand side being constant, the left-hand 
side will be as well. Let us call this constant — /3^, the negative sign signifying 
that the force is attractive. We can then write, in place of (3.82), 



d^X __ ^ 



\2 

A CJ 3 = a. 



(3.84) 



By multiplying both sides of the first equation by dX/dt and integrating, we 
obtain 



2\dt A 2A2 



with 7 an integration constant. By substituting the second equation here, we 
finally get 



1 fdA 

2 



^ \2 2 

2 ^ = 



which, if we take into account (3.77) and compare with (2.9b), looks like the 
energy integral for Keplerian motion. Hence (3.84) represent the equations 
of Keplerian motion. For constant A, oos will also be constant and the Keple- 
rian orbits will be circular; then there will be a collinear solution of relative 
equilibrium: the N points stay fixed on a straight line which rotates with 
angular velocity ujs around the barycentre. For a = 0 { 0 J 3 = 0), the line will 
stay fixed (rectilinear solution) and the motion will be homothetic; we have 
already seen that in the case of a rectilinear solution the motion ends in a 
collision. 

Going back to (3.83) and setting the right-hand side equal to — /3^/A^, 
we obtain 






rrik 






P^x°=0, j,k = l,2,...,N. 



(3.85) 



The quantities Xj in (3.85) depend only on the masses and on the initial 
coordinates, and so (3.85) represent N conditions which they must satisfy 
for the motion to be homographic. Since rrij Xj = 0 also holds for the N 
quantities Xj^ there are actually only N — I independent relations. 

For N — 2^ from (3.85) we have 



m 2 



Xn — XT 



+ /3 xJ — 0, 



mi 






0^ X 2 — 0 , 



which, reexpressed in the variables give the circular Keplerian motion 
around the barycentre of the two bodies with angular velocity = /3 A“^/^. 
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For N — 3, from (3.85), assuming, for instance, xj < X 2 < Xg, we can 
obtain an equation of fifth degree whose unknown is the ratio 6 = r 12/^23 
between two of the three mutual distances of the bodies. The solution turns 
out to be unique for 0 < (5 < 1, and this solution is just the collinear solution 
found by Euler in 1767. We have not carried out the calculations explicitly, 
since we shall obtain the fifth-degree equation in another way in Sect. 4.1. As 
there are three possible ways in which to arrange the three aligned bodies, 
there are just three possible collinear solutions in the case of three bodies. 

We can summarize what we have learnt about collinear homographic mo- 
tions by referring to (3.85). The relative positions of the N bodies at the 
initial instant are the same for both a = 0 and a ^ 0. For a = 0, the total 
angular momentum vanishes and the motion will be homothetic on the ^ = x 
axis fixed in space. In the second case, a ^ 0^ the purely homothetic motion 
is not possible: one has planar motion and the straight line on which the N 
bodies are placed rotates around the bary centre; in the case of constant A, 
the N bodies will rotate all with the same constant angular velocity, and 
therefore there will be a solution of relative equilibrium. We have seen that 
for N = 3 the possible solutions are 3!/2 = 3; in the case TV = 2, there is 
only one possibility, that is, 2!/2. Moulton^^ demonstrated that, if Up is the 
number of the possible solutions for TV = p, then {p 1) rip is the number 
of solutions for TV = p -h 1; by induction, the number of solutions for any TV 
is then given by TV!/2 (Moulton’s theorem). 



b) Flat Homographic Motions 



Let us assume as the initial plane of the motion the C = 0 plane; then 
= 0,\/k and also (k = 0, V/c, at any subsequent instant. Equations (3.81) 
become 



A (wl + w|) 



X‘^LOiU>2 ~ 



A^wiWs - — (A^o) 2) 
at 



+ yj 
+yj 



LOiU>2 — 



X-J^ - A^ (o;| +wj) 















rrik 






A^o;2^3 ^ 1 ) 

at 



= 0 . 



Vk - Vj 
(3.86) 



From the third of these 



A^ U>IU)3- (A^ (J 2 ) _ X 2 _ x% 

A2w 2W3 + ^ (A^Wl) y°i V 2 Vn' 



®°F. R. Moulton: The straight line solutions of the problem of 7V-bodies, Ann. Math. 
II Ser. 12, 1-17 (1910). 
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from which one concludes that the N bodies lie on the same straight line, 
and the motion is collinear homographic. We exclude this case, having already 
dealt with it; consequently, it must be the case that 

(jji — — (A^ LU 2 ) — O5 A^ ^2 ^3 + ^ <^i) = 0- (3.87) 

By multiplying the first of equations (3.86) by A^ uj 2 and the second by A^ ijj\ 
and subtracting them, we get 

^ ^ (A^ ^2) ^ ~ 

which, integrated, gives 



A^ {loI Lol) = const. 



(3.88) 



Furthermore, inspection of the first two equations shows that the quantities 






cPX 

dfi 



A^ (a;^ + o;|), A^o;ia;2, 



are all of the form const, x A“^. By subtracting, we also find that A^ {(J 2 —^i) 
and A^ {uo\ + UJ 2 ) are of the same form. Therefore, by multiplying the last 
quantity by A , we get 

A^ {(Jl + LO 2 ) = const. (3.89) 



Since (3.88) and (3.89) must hold simultaneously, there are two possibilities: 
o;i = u;2 = 0, or A = constant. In the latter case, also uj\ -\- uj 2 ^ ^2 + 
cjg, (jj\ +u;3 are all constant and, from (3.87), uoi (^3 = u;2^3 = 0 as well. As 
a consequence, either cji = u;2 = 0 (which comes into the first case, not yet 
examined), or uo^ — 0. If — 0, the motion is a rotation about a line of the 
C = 0 plane; we can assume this line as the ^ axis, and then o;2 = 0 also. 
Equation (3.86) then reduce to 



k¥=j 



rrik 



- Xj 



2 o 1 Vk Vj 

k^j ^^jk) 



From the first of these, 



Y1 



^ rrik 



'I ^ 



1 

2 



E 

l<j<i<N 



mjmi{x° — X 



— 



iJ 






0 , 



but this is absurd, since it is a sum of terms which are all positive. Therefore, 
it must be that ous 7^ 0, and, at the same time, ui = 002 = 0; the total 
angular momentum will be different from zero and the motion will not be 
homothetic. The C == 0 plane will rotate around the ( axis remaining fixed 
in space, and therefore the motion will be planar. In the case of constant 
(jJs (A = const.) the solution will be of relative equilibrium. 




226 3. The A/'-Body Problem 



Summarizing, we conclude that a flat homographic motion is also planar 
and cannot be homothetic. Once we have ascertained that the motion takes 
place in a flxed plane, to study the solutions it is convenient to take this plane 
as a complex one and to represent the point masses rrik (k — 1,2,. ..,A^) 
in the complex plane. To signify that the geometric flgures formed by these 
points must remain similar as time passes, we shall represent our points by 
means of 

Xk-^iyk = Zk = akq{t) (A: = 1,2, . . . ,iV), (3.90) 

where ak are complex constants, in general different, and q{t) a complex 
function of the real variable t (the symbol Zk has a meaning different from 
before). Now 

rki = \zk - zi\ = \ak - ai\ \q\. 

The equations of motion (3.1a) become, in the planar case and in complex 
form, 

Zk = G^mi - 3 — ■ {kj = (3.91) 

i^k 

By substituting, for g 0 , one has 



dkq 



l^k 



di - ak 
\ai - UfcP’ 



and, since not all the constants ak vanish, one also obtains qq^!^ = b 

(a constant not depending on t), because \q\ = {qq^^^^ In fact. 



dkq 



q\q\ 

l^k 



ai - Qfc 
\ai - ak\ 



= qq ‘^l^q G ^ mi 

li^k 



ai — dk 

\di dk I 



and then, multiplying this by , one obtains on the right-hand side a 

quantity that does not depend on A, as we have said. Therefore, the problem 
is turned back into that of the solution of the differential equation 

^•^1/2 ^3/2 ^ ^ j^3 g2) 

(which is independent of N) coupled with the algebraic system 



akb = CY, 

l^k 



mi 



di dk 
\di - dk\ 



(A:-l,2,...,iV). 



(3.93) 



Since (3.92) is the same for any it is clear that we can consider its solution 
known, since the solution of the two-body problem is known. We shall make 
use of this to get information about the constant b. 

For N = 2, mi ai -h m 2 d 2 = 0, which can be reexpressed by putting 
di = m 2 d and d 2 = —mid (where a is a not vanishing complex number). From 
system (3.93) we immediately obtain b = —(mi -f m 2 )~^ Therefore b 
is a real and negative quantity. Then, if we consider the function q = q{t), 
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the solution of the differential equation (3.92), to be a known function with 
b <0 and ay^O, x -\- iy z = aq{t) will be the parametric representation 
on the X y plane of a conic having one of the foci at the barycentre. 

If we come back to the A/'-body problem and suppose that ai, U 2 , . . • , 
be an A/^-tuple of complex numbers satisfying the algebraic system with b < 
0, then any solution of (3.92) coupled to it will provide a solution of the 
equation of the motion such that every point will follow a conic and in the 
course of time the polygons formed by the N material points will remain 
similar to themselves. In the particular case in which the conic is an ellipse, 
the function q{t) will be a periodic function so that there will be a periodic 
planar homographic solution. It is easy to check that the case of circular 
orbits corresponds to q ^ where Cc; is the angular velocity. In such a 
case we obtain 

We will again use this argument when we study in detail the equilateral 
solution for N = 3. 



c) Homographic Spatial Motions 



Let us now assume that the N bodies do not all lie, at a given instant, in the 
same plane. Taking the C axis as the instantaneous rotation axis, we then 
get (jJi = lj 2 = ^ and ujs ^ 0. The equations of motion (3.81) become 






2 ,2 

’3 



o ^ /\2 \ 1 









d^2 



A Lu‘^ 



1 Vk- Vj 



(3.94) 






^ ^ dt^ 



1 v; — ^ 



zY - z. 



k^j 






Multiplying the first of (3.94) by -mj y° and the second by rrij and 
summing them, we get 

ra, [(x”)^ + (!,;)^] I = ( E ^ WIl'A “ '4> ~ v" (4 “ ^J)]. 



and, summing over j, 

N 

m ■ \ ( ' . , , , , . . . 

- 3 ) J 



N 



E [(»■;)" + s = I E E ^ 






j = l k^j ^ 3 k 



from which 



A^ (J 3 = const. = a. 



(3.95) 
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Substituting in the first two of (3.94), we obtain 



X X 9 9 11 V — ^ 



dt^ 



A 



rrik 









~—y 

A ^ 



rrik 



k^j 



Vk - Vj 



const. 

A 



From the third of (3.94), we also have 



cPX _1 1 ^ zl- 



Then, finally, A^cjI = const, x A that is, 

A^ uJ^ = const. 



(3.96) 



Comparing (3.95) with (3.96), we see that there are two possibilities: either 
A = const, with cjs 0 or A arbitrary with c^s = 0. In the first circumstance, 
from the third of (3.94), we have 



k^j 




= 0 , 



which, multiplied by ruj Zj and summed over j, gives 



N 

Y1 

3 = 1 k^j 




1 

2 



E 

l<j<k<N 



rrij 



{z° -z°)^ 



= 0, 



which is absurd, being a sum of positive terms. As a consequence A cannot be 
constant with los 0. It therefore remains the case that us = 0, for which, 
from (3.94), we have 

A^ ^ const. = -/3^, (3.97) 

where, as usual, we insert the minus sign to signify that the force is attractive. 
From (3.97) we deduce that every material point moves on a straight line, 
fixed in space and passing through the barycentre, and follows a Keplerian 
motion. As uji = uj 2 = = 0, the total angular momentum of the system 

vanishes and the motion is homothetic. 



Conclusions 

From what we have seen in (a), (b), (c) we can conclude that every ho- 
mographic motion is flat (in particular it can be collinear), if the angular 
momentum c is different from zero; or homothetic, if the angular momentum 
vanishes; or, finally, both flat and homothetic (as happens in the case of the 
collinear homographic solution with a fixed straight line). A homographic 
motion with vanishing angular momentum is a homothetic motion in which 
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every body moves on a straight line passing through the barycentre of the 
system and stands on one of the vertices of a polyhedron which always re- 
mains similar to itself in the course of time and always takes an analogous 
position with respect to the barycentre. 
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What we have got up to now is the classification of all the possible homo- 
graphic solutions; however, the question of the existence of such solutions 
remains open. From what we have seen, if a homographic solution exists, it 
is determined on one side by the function A(t) and the matrix 17(t), and 
on the other side by the N vectors representing the initial positions. Our 
concern here is to find the necessary and sufficient condition for the existence 
of a homographic solution. 

The sufficient condition was formulated in rigorous terms by Laplace^^ 
and is known as Laplace^s theorem: 

If the initial configuration of the N bodies is a central configuration, then 
the motion is homographic. 

As we will see, this condition is necessary too. Let us first specify what we 
mean by central configuration: at a given instant t = to, the Wbody system 
gives rise to a central configuration if the gravitational force acting on every 
mass point ruk is proportional to ruk itself and to its position vector 
referred to the centre of mass. This means that 

dU 

— =amkrk, V k = 1,2, . . . , N, (3.98) 

OTk 

with the scalar a not depending on /c. It is clear that, if cr remains constant 
in a solution, the motion is that of an oscillator. The quantity a can be found 
in the following way. Multiplying (3.98) by and summing over k we get 



dU 



^ 



that is, —U = cr from which 



dU 

drk 



U 

jruk r/e. 



(3.99) 



where we have made use of Euler’s theorem and the definition of the moment 
of inertia. Moreover, since dl/dvk = 2 (3.99) can be written, if we 

multiply by I [/, in the form 



S. de Laplace: Traite de Mecanique Celeste^ Tome quatrieme, Livre X (Paris, 
1805), pp. 307-313. 
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2IU^ + U^-^^0, 

OVk OVk 



or even 

r\ 

t^(/C/2)=0. (3.100) 

It follows that the central configurations are specified by the critical points 
of the function I . 

Before proving Laplace’s theorem, we must establish some relations that 
will be useful for the proof. If we indicate by 



Xk\ 

Tk= \Vk \ 

\zk) 

the position vector of the k-th material point in the inertial barycentric frame, 
we have 

/ 6 \ f Xk\ 

Qk= I % 1 = f'k = I 2/fc 1 (3.101) 

for the position vector of the same point in the rotating frame. The matrix 
Q = n (t) is the orthogonal matrix (with determinant +1) introduced in the 
previous section; we assume that both Q and the vectors Vk and are 
functions of t. As f2 is an orthogonal matrix. 



that is, 

nff = n^n = \. ( 3 . 102 ) 

Differentiating, we get 
from which 

n + nF' ti = 0, 
if ^ 17 = 0 , 

fi = iif . 

Therefore 17 is an antisymmetric matrix, which we shall denote by 17, 
and, introducing a vector^^ c^?, we can write 



uj = 



CJi 

UJ2 

CJ3 



E = fi~^i2 = 



0 -cja UJ2 

uj^ 0 — (jJi 

—LU2 UJi 0 



= -E^ 



(3.103) 



^^See A.Wintner: op. cit., p. 50. 
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Differentiating, we get 



n) = n + n 

= i7 + {f2 sf ti = = n - 

where = S S is the matrix of elements {ojiUk — hk) and = 
Wj + Then 

h = t: + . (3.104) 

It can be also verified that 



I!Qk=u}XQk (3.105) 

EQf.=ui'XQk (3.106) 

Qk = {^ ■ Qk) ^ Qk- (3.107) 

Multipying (3.101) by 17 and differentiating, we have ffc = fi Qk + fi Qk, 
from which 

17 ^ Tk — O ^ fi Qk 8k — 6k ^ 6k’’ (3.108) 



where we have used (3.103) with (3.105). Comparing (3.108) with (3.79a), 
we promptly see that the vector w is nothing but the angular velocity of the 
rotating frame with respect to the barycentric inertial frame. The left-hand 
side of (3.108) represents, in fact, the velocity of the fc-th point referred to the 
rotating frame, obtained by operating with the matrix 17“^ on the vector 
Tk (velocity referred to the inertial frame). A further differentiation of rk, 
with the use of (3.105-107), provides the acceleration 

17“^ fk = Qk + ^‘^>^h+‘^^6k + i<^-6k)^-^^ek- (3.109) 



In the case of a planar solution, once the plane C = 0 has been chosen as the 
plane of the solution, it can be made to coincide with the z — 0 plane of the 
inertial frame, and therefore the matrix 17 = 17(t) can be expressed in the 
form 

/ cos 4> — sin (f> 0 \ 

17= j siiicp cos<f> 0 j , (3.110) 

\ 0 0 1 / 



4> being the rotation angle. Using (3.103), we obtain for the matrix S 



E = 



0-0 0 

0 0 0 

0 0 0 



0 — W3 0 

W3 0 0 

0 0 0 



(3.111) 



r 

n;= 0 

u 



since, in this case. 



(3.112) 
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From (3.111) and (3.112), we obtain in this case the expressions for the 
velocity (3.108) and acceleration (3.109): 

f ik\ / 0 \ f ^k\ fik-Vk<P\ 

+ l^j ^ (3.113) 

«-n-(t)+2(o)x(|).(o)x(|) 




( Cfe - 2 t)a:<A - ^k^^ - 

Vk + ‘2^k^ - Vk^'^ + j ■ (3.114) 



Returning to the general case, we can write (3.77) in the form Qj^ = 
Xrl = Alr^, where, as usual, 1 is the unit matrix. Differentiating, yields 
Qj^ = XI rl and = A 1 r^. In place of (3.108) and (3.109), we have 



fl~^Vk 



(Xl + XE) rl 

\X1 + 2XS + X{S + S^) 



(3.115) 

(3.116) 



Moreover, from the definition of potential, 






du 

drk 



1 dU° 1 
A2 dvk ~ A2 



(3.117) 



where by , we refer to the value of the acceleration at the time to, so that 
it is a constant vector. Therefore, if r^(t) is a homographic solution, we have 



.. dU _ 
rrik Vk = -X — = rrik U 
dvk 



Xl + 2XS + X{ij + S^) 



_ ^ 1 _ _ 1 
“^A2 drk -^a2 



r 



O 

k 



from which. 



with 



al = K{t)rl, 



K{t) = X^ 



Xl + 2XS + X{iJ + E^) . 



(3.118) 

(3.119) 



In the case of a planar solution, with uji = L 02 — 0 and cjs = 0, the kinetic 
energy and the angular momentum will be 
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2 '' 2 ’ (3.120) 

c = x'^u)3r = x^^r. 

From the first of (3.120) and from (3.80), the energy integral will be 

r -u = \{x^ + x^ ^'^)r ^h. (3.121) 

2 A 

The matrix K{t), in turn, will be: 

/A^(A-A(^2) _a2(A(^ + 2A0) 0 \ 

K(t)= A2(A0 + 2A0) A2 (A-A02 ) q ^ (3^122) 

\ 0 0 A^A/ 

In the general, not planar, case of homographic motions, we saw in Sect. 3.7 
that the only possible motions are those corresponding to ui = 0 J 2 = ^ 

(and c = 0) so that equations (3.120-122) are still valid if we assume = 
0, c = 0. The Lagrange- Jacobi identity will also continue to hold in the form 

o tjo 

7 = 2C + 4/1 = ^(A2 7°) = + 4/i, 

dt^ A 

from which 

7 ° (A^ + A A) - ^ = 2 /i. (3.123) 

A 

We can now prove that a necessary condition for the existence of homo- 
graphic motions is that the configuration be central, namely that, if homo- 
graphic solutions exist for the Wbody problem, then the configuration is 
central. Let us introduce the following constants: 



u° 

Too = 


(3.124) 


II 

0 


(3.125) 


0 

II 


(3.126) 


By substituting (3.124) and (3.125) into (3.121), we obtain 




{X^ + X^-^) ^7° =2/^0. 

A 


(3.127) 


From (3.126) and the second of equations (3.120), 




(M 

II 

0 


(3.128) 


Finally, putting (3.124) and (3.125) into (3.123) yields 




( a 2 + aa )-^ = 2/io. 

A 


(3.129) 
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Subsequently, from (3.127) and (3.129), we get 

A-A,^2^^ = 0, 

and, differentiating (3.128), 2 A A 0+A^ 0 = 0. As a consequence, substituting 
into K{t)^ we shall obtain 

/ —rrio 0 0 

K{t) = I 0 —mo 0 

\ 0 0 —mo + A^ 

Since, by (3.118), we have 

it follows that, both in the case of planar homographic motion, with = 0 
and dU° Idzk = 0, and in the general (0 = 0) case, condition (3.98) is 
satisfied for a = —rrio = —U°/P at the initial instant t = to- As the choice 
of to is arbitrary, the necessity of the condition is proved. 

We can finally prove Laplace’s theorem, that is, that condition (3.98) is 
sufficient, besides being necessary for the existence of homographic motions. 
We have already observed that the definition of homographic motion can be 
written, both in the planar and in the general case, in the form 

Tk =\{t)fl{t)rl, 

with the matrix 17(t) given by (3.110) and setting ^ = 0 in the general case. 
Therefore every homographic solution turns out to be determined by the N 
initial position vectors and by the two functions \{t) and 0(t), which 
we will assume to be and subject to the constraints 

Ao = A(to) = 1, <Po = 0(^o) = 0, 00 = 0(^o) ^ 0. 

To prove Laplace’s theorem it is therefore sufficient to prove that every solu- 
tion Vk = X rl, for which the consequences of (3.127-129) can be assumed 
to be valid, satisfying the condition (3.98) is also a solution of the Wbody 
problem. In fact, it is clear that, if the vectors rk are of the form A 17 r^, they 
either represent homographic solutions or are not solutions. By hypothesis, 

dU^ 

- — = -moTrikrj^. 

dvk 




From (3.117), the equations of the homographic motion can be written in the 



form 




_L ^ 

A2 dvk ' 



Inserting into this the relations implicit in the hypothesis, we have rrikrk = 
— A“^ 17 rrio rrik so that: 



A^ 17 ^Vk = -rrio r% 



(3.130) 
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For our thesis be proved, we need (3.130) to be an identity in while at 
the same time (3.127), (3.128) and (3.129) hold. Whit this goal in mind, let 
\{t) and 0(t) be two given functions of class C^, f2{t) a matrix function 
of <p according to (3.110), and finally K{t) a matrix given by (3.122). Dif- 
ferentiating and using the multiplication rules among matrices, we then find 
that 

^{Xf2) = K. (3.131) 

On the other hand, differentiating the homographic solution twice gives 

which, by virtue of (3.131), provides the identity (in t) Vk = 

Comparing this with (3.130), we have only to prove that 

Krl = -morl (3.132) 

is an identity in t. But this has already been proved: in fact, we found that, 
both in the planar and in the general case, K = -rrij^ ^ {dV/drk) holds, 
so that, since now by hypothesis dU^fdvk — — mom/e r^, (3.132) follows 
straightforwardly. 




Chapter 4 

The Three-Body Problem 



Historically, the three-body problem is the most important problem of celes- 
tial mechanics and for about two centuries has also been the most extensively 
studied problem of the whole mathematical physics. In this chapter, we con- 
sider only the “classical” subjects, while the more “modern” ones (KAM 
theory, chaotic solutions, etc.) find their place in the Volume 2. Of course, we 
have tried to fashion the treatment of the subject in such a way that it seems 
quite natural to insert subsequent developments. It is very hard, indeed im- 
possible, in so little room, to succeed in introducing all the main problems; 
therefore, we have made a choice, consistent with the general intention of 
the book. For the rest, we have endeavoured to provide the reader with the 
tools and the information necessary for continuing to study the subject in 
specialized books or original papers. 
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4.1 The General Three-Body Problem 

As Wintner already warned/ . . every new generation is usually compelled 
to reinterpret what the “problem” of three bodies actually is” . As a matter of 
fact, the three-body problem, for more than two centuries the most famous 
(unsolved) problem of mathematical physics, has spurred on the study and 
the progress of several branches of mathematics and, as a consequence, it 
has been examined from different, yet complementary viewpoints. Wintner ’s 
statement remains valid, as we shall now try to summarize. 

The system of differential equations for the three-body problem (see be- 
low: (4.1)) cannot be reduced to quadratures, and hence the problem is not 
solved; on the other hand, one can find a convergent series expansion along the 
whole of the time axis for the solution and consequently state that the prob- 
lem is “solved” . The first assertion (negative) has to be considered inadequate 
and the second one (positive) meaningless. The appropriate way, nowadays, 
to face the problem is to study the motion of the representative point on 
the manifold defined by the integrals of energy and angular momentum (the 
time having not been eliminated, this manifold is seven dimensional). In this 
section we shall give an account about the information which can be obtained 
from the set of equations, and also evaluate its homographic solutions. In the 
next section we shall present the main ideas of Sundman’s method to obtain 
the above-mentioned series expansion (regularization) and we shall briefly 
explain the subsequent treatment by Levi-Civita of the same problem. The 
study of the flow on the manifold of energy and angular momentum and the 
treatment of periodic orbits will be dealt with in Volume 2. 



The System 



For V = 3 , the set of equations (3.1a) becomes 



Gm2Tni , . Gmsirii . . 

Gmsm2 , G mi 1712 , 

m2V2= , ,3- (r3-r2)+ (n - r2) , 



ms rs = 



{rs2r 

Gm\ ms 



Gm2m3 

(ri - rs) + — — ^ r2 - r^) . 



It is a system of order 18 and, as we have already said, may be reduced, 
by means of the 10 first integrals and the two subsequent reductions, by 12 
orders. In general, there remains, therefore, a system of order 6: the solution 
of such a system is what has constituted, historically, the three-body problem. 
The actual reduction of the system to the sixth order has very little interest, 
since we do not know how to solve the reduced system. It is interesting. 



^ A. Wintner, op. cit., pp. 345-346. 
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on the other hand, to study the particular solutions which we mentioned in 
Sects. 3.7 and 3.8, that is, the homographic solutions. In our case, they can 
be either collinear or planar, i.e. the Lagrangian solutions recalled above. 



The Lagrangian Solutions 



Let us start from the collinear ca.se . The conditions that the initial coordinates 
should satisfy, for instance at the time t = 0 , to have a homographic motion 
are given by (3.85), which become, for N — 3, 



m2 

ms 

mi 



{x°2 - 

{rl2? 

(rhr 

(^31 



+ rri 3 



+ mi 



+ TO 2 



(a^3 -x°i) 

{rhr 

{.A - X°2) 
{Aif 

(A - A) 
{^2^ 



+ Xj — 0, 

+ rc2 = Oi 
+ /3^ xl = 0 . 



(4.2) 



Of the three equations (4.2), only two will be independent, since the coordi- 
nates refer to the inertial barycentric frame. Without loss of generality, we 
shall put < X 2 < x^ so that ri 2 — '^21 = ^2 ~ ^15 ^13 == '^31 — ^3 “ ^1 

and ^23 — '^32 = X 3 — X 2 - By subtracting now the second of equations (4.2) 
from the first one, we now obtain 



P^r°i^ = 



(mi + m2) 

(r? 2 )^ 



+ m 3 



(^23)^ 



and, by subtracting the third equation from the second, 



(A As — 

Furthermore, by defining 



(m 2 + m 3 ) 

{rhr 



+ mi 






(5 = 



Xl~X2 

Xi) and x^ — X 2 = {x^ — Xi)/6, we get 

<52 52 



> 0 , 



so that X 3 



Xi= (1 + 1/(5) {X2 



r^?2 = 



/?2 



(mi + m2) 

(rhr 

{mi + m 3 ) 52 



m 3 



+ mi 



(l+(5) (r°2)2 
(52 






1 



(1 + (5)' (r?2)2 {A 2 H 



(4.3) 



By eliminating rj 2 from (4.3), we obtain an equation which is of fifth degree 
in 6: 



{m 2 + m 3 ) (5®+ (2 m 2 + 3 m 3 ) 5“* + (m 2 + 3 m 3 ) (5^ 

— (3mi + m 2 ) (52 — (3mi + 2 m 2 ) d + (mi + m 2 ) = 0. 



(4.4) 
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If we write f{S) for the left-hand side, then 

lim f (S) = — {mi + m 2 ) , lim / ((5) = + cxd. 

(5^0 S^oo 

Hence, (4.4) will have at least one positive real root. Since the coefficients of 
the powers of S in (4.4) show only one change of sign, according to Descartes’s 
rule, we shall have one and only one positive root. Since there are three ways 
to arrange three points on a straight line, there are three possible collinear 
solutions. In the case where A is constant (see Sect. 3.7), the three bodies 
will all rotate at the same constant angular velocity, and then there will be a 
relative equilibrium solution (see Fig. 4.1). When A is not constant, on the 
other hand, the three bodies will describe conics having one focus in common. 
Let us move on to the planar case. 

We start from system (3.93), where the unknowns are the complex con- 
stants ak{k = 1,2,3), which, together with the function q{t), characterize 
the positions of mi, m 2 , m 3 in the complex plane. If at the time t = 0 we 
normalize the function q{t), i.e. we make the real part equal to 1 and the 
imaginary part equal to zero, then g( 0 ) = 1 , and in addition 



ai=Xi+i^i, a2=X2+i?/2, + 



where x^. = Xfc(O). System (3.93), supplemented by the condition that the 
coordinates are barycentric, will therefore give 



xlb = G 
xlb^G 
xlb^G 



m 2 



m 3 



mi 



{rl2? 

(rhr 

(x°i - xl) 

(^ 31 )^ 



+ m3 
+ mi 
+ m 2 



0 = mi a:i + m2 X2 + m3 X3 



y°,b = G 
y°b = G 
y°sb = G 



m 2 



m 3 



mi 



{y2-yi) 

ir°x2r 

(^ 3 -^/ 2 ) 

(^’ 23 )^ 

jyt-ya) 



+ m 3 



+ mi 



+ m 2 



0 = mi j/i + m2 2/2 + m3 2/3. 



(rtsr . 
K - AY 
{r°2,f . 

{X°2 -Xg) ' 

(^^2)^ J 

(yg -ViY 

. 

(y? -yg) ' 

(r- 2 °i)" . 
{y^-vsY 
{r°32r . 



(4.5) 



In (4.5), there are obviously only six independent equations, since the equa- 
tions for the coordinates of the barycentre can be obtained from the other six 
by multiplying them for the corresponding masses and by summing them. 

Let us now demonstrate that the only admissible configuration, besides 
the collinear one, is the configuration where the three masses are at the 
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corners of an equilateral triangle. In fact, we start by supposing that it will 
not be like that: for instance, ^ Since the orientation of the axes is 
completely arbitrary, we can fix it by letting the x axis pass through the mass 
point m3. Then = 0, and from the equation of the barycentre mi yj = 
—m2 2/2, which is consistent with 

^1 Vi ^2 V 2 ^ ^ 

(r-3°i)^ (r-3°2)^ 

only for — Q, if ^ Repeating the procedure, by making the 

X axis pass through one of the other two points, one immediately finds that 
the only solutions of the system are those corresponding to: 

1) »’?2 =^§3 = ^31- 

2) the three points on the same straight line. 




Fig. 4.1 



Fig. 4.2 



In conclusion, the only homographic solutions of the three-body problem 
are given by the Lagrangian solutions: the collinear solution and the equilat- 
eral solution. Writing tq for the side of the equilateral triangle at the time 
t = 0, from (4.5) we get 

o u ^ ry “^2 ^1 - m3 xj + (m2 X2 + m3 X3) _ G(mi + m2 + m3) xl 

^ ^ l3 l3 ’ 

^0 ^0 

rg 

If we put mi + m2 + m3 = M, then 



-b^ 



GM 



(4.6) 



it not being possible to have = 0, V fc. 

We have already said in Sect. 3.7 that in the case of circular orbits one has 
—b = a;^, where uu is the angular velocity common to the three bodies (ro- 
tating in the inertial barycentric system around the barycentre, see Fig. 4.2). 
Hence 
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In the general case (A not constant) the orbits will be given by conics; in the 
case of elliptic orbits (see Fig. 4.3) q{t) will be a periodic function. 




The Jacobi Coordinate System 



In the above-mentioned paper referring to the elimination of nodes, Jacobi 
introduced a particular system of coordinates so as to reduce the study of 
the iV-body problem to that of an equivalent problem of AT" — 1 bodies of a 
suitable mass. Furthermore, the final system of equations is still a canonical 
system. Let us examine it for the case N = 3 . First we refer the motion of 
rri2 to the body mi, by setting r = r2 — ri, and the motion of m3 to the 
barycentre of mi and m2. If we define /i = mi + m2, the coordinates of 
this barycentre are given by (mi ri + m2 v^)/ li or —m3 r-^j if we choose 
barycentric coordinates, and thus mi ri + m2 V2 + ru3 r3 = 0. If we denote 
by Q the vector distance of m3 from this barycentre, then 

m3 Tz (mi + m2 + m3) Tz M 

Q = rz-\- ^ ^ = — rz. 

mi + m2 mi + m2 /i 

In addition, one can easily verify that 

m2 mi 

rz - ri = Q-\ r, vz - V2 = Q r. 

fi II 



By dividing the first of equations (4.1) by mi, the second by m2, and by 
substituting one into the other, one gets 



Gii 



r Gmz 



'Q-{mi/n)r Q+{m2/fJ.)r 



{r2sr 



{my 



(4.8) 
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By next multiplying the third of the equations of motion by M \ 

one eventually obtains 

GMmi , , , , , GMnfi 2 . r / \ \ 

Q = 7 — ^ {q + {m 2 jj) r) — ^ {q - (mi//i) r) . (4.9) 

iJ-insr m(^23)'’ 

If we define the relative velocities v = r, V = g and the masses 



^1 + ^2 

we obtain in the new coordinates 



Ms = 



ms (mi + m 2 ) _ m3/i 
mi + m2 + m3 M ’ 



c = M 2 {r X v) Ms {g X V) ^ I — M 2 T^ ^MsQ^ T — —M 2 V^ -\- —MsV^ . 
If furthermore p — M 2 P = Ms V, and 



n = T-u 



1 1 Gmim2 Gm2ms Grrismi 



2 M 2 2 Ms r V 2 S 

we can obtain the Hamiltonian form of the previous system: 

. dn . dU . dn . d% 

’•=8?' = 



System (4.10), of order 12, is then a Hamiltonian system relative to two 
bodies with fictitious masses M 2 and M3. By exploiting in the usual way the 
integrals of energy and angular momentum and eliminating the time and the 
nodes one of course gets to order 6. The adoption of the Jacobi coordinates, 
then, does not imply any variation in the number of the reductions. However, 
as we shall see, it is the most straightforward way to go from the general 
problem to the restricted one. As we stated at the beginning, Jacobi’s method 
applies in general to the A'-body problem and consists mainly in the choice 
of two vectors: one given by the position vector of the barycentre, which will 
be assumed equal to zero by fixing the origin at the barycentre itself, the 
other given by the distance between two of the three bodies. 

Let us see how the method is applied in practice, finding system (4.10) 
once more. Since this is a canonical system, we may obviously take it as 
obtained by means of a canonical transformation performed on an initial sys- 
tem, equally canonical. Consider system (4.1), which can certainly be written 
in Hamiltonian form, assuming the components of ri, r2, rs as coordinates 
and the components of = miVi, P 2 = m 2 V 2 , Ps — msVs as conjugate 
momenta. Let us now look for a canonical transformation which gives as new 
coordinate vectors 



r = r2 -ri, g 



miri -\-rri 2 V 2 

rs , r^ 



mi ri + m 2 r2 + ms rs 

M 



where and M have their previously defined meanings. In so doing, we have 
not yet chosen the position of the origin, by not fixing it at the barycentre. 
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Assume that W 2 = W 2 (r, q, rc.m.; Pi, P 2 , P 3 ) is the generating function 
of the desired transformation. Hence 



dW 2 



dW 2 



dW 2 



yjVV2 KJYY2 UYY2 / A -i-i\ 

ri = ^ — , V 2 = ^ — , rs = -- — . (4.11) 

OPi dp2 5P3 

On the other hand, by means of a bit of algebra, one can also easily obtain 



m 3 m 2 



m 3 mi a 



From (4.11) and (4.12), one then has that W 2 is given by 



(4.12) 



rrr I m3 m 2 \ ( m3 TT^l \ / P \ 

w, = ( -r jp, + - -p+ jp,+ + M 

(4.13) 



The new conjugate momenta will be obtained by differentiating (4.13) with 
respect to r, g, rc.m.- The result is 

dW2 7712 mi mim2 .. . , , 

= Pi + — P2 = ; {r2 - ri) = M 2 [r2 - ri) = p; 

or IX li mi m 2 



dW 2 m 3 



dg 

dW 2 



M 



(/xr 3 - m 2 f *2 mivi) = M 3 V = P; 



dr 



= Pl +P 2 +P 3 =Pc.m.- 



If now, lastly, we put the origin of the coordinate system at the barycentre, 
i.e. by setting rc.m. = 0 and Pc.m. = O 5 we get the system described by 
the canonical vectors r, g, p, P and consequently have system (4.10) as the 
Hamiltonian equations of the motion. 



4.2 Existence of the Solution — 

Sundman and Levi-Civita Regularization 



As we have said several times, in the case where the conditions for the appli- 
cability of Cauchy’s theorem are satisfied, there will also be a solution which 
can be represented by a convergent series expansion. Since the radius of con- 
vergence of a series is determined by the position of the nearest singularity, 
it appears that the first difficulty that one faces, when trying to build a solu- 
tion given by a convergent series expansion, is given by the singularities due 
to binary collisions. In the case of the three-body problem this difficulty will 
also be the only one, provided that the case of zero angular momentum is not 
taken into consideration. In Sect. 3.3, we recalled the theorem which states 
that collisions of whatever type are not probable in Newtonian gravitational 
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systems; however, they do exist and, when they occur, they cause a decrease 
of the convergence radius of any series solution. 

Needless to say, we are trying to study the collisions in the mathematical 
model of three mass points subjected to their mutual Newtonian attraction. 
In physical reality, it would be meaningless to deal with the analytical con- 
tinuation of the solution after the collision, since the collision itself, doubtless 
inelastic, could substantially modify the problem. The problem is then ex- 
clusively the problem of guaranteeing the existence of the solution of system 
(4.1), for any value of t. 

The other feature of the problem of regularization, that is, that of the 
numerical calculations already mentioned in Sect. 2.6, does not have great 
importance in the case of the general three-body problem, since actual appli- 
cations in celestial mechanics deal almost entirely with the restricted problem, 
which we shall analyse in the following sections. The complete regularization 
of the three-body problem was obtained for the first time by Sundman in 
1907,^ by applying rather cumbersome procedures. 

Sandman’s approach is not direct; it requires the introduction of a rather great 
number of auxiliary variables, and clumsy calculations, in order to obtain a regu- 
larized system which can no longer be included in the framework of the dynamics; 
this certainly is a serious inconvenience, because it is no longer permissible (at least 
without preliminary discussion) to apply to such a system either theoretical results, 
or the methods of calculation of analytical mechanics.^ 

Levi-Civita’s criticism is addressed mainly to the consideration that in Sand- 
man’s theory it is not possible in the end to “read” the equations as dynam- 
ical equations, that is, equations which still keep their initial features, for 
instance, those of a Hamiltonian system. This property, as we shall see, will, 
however, be maintained in Levi-Civita regularization. 

We should add, to give the whole truth of the matter, that the route 
initiated by Sundman is still followed when looking for global solutions of the 
Wbody problem. Taking into account the complexity of Sundman’s method, 
we shall confine ourselves to dealing only with the one-dimensional problem, 
which, however, demonstrates of the method fully. We shall follow step by 
step the nice presentation given by Saari.^ 



^ See Footnote 28, Sect. 3.4. 

^ T. Levi-Civita: op. cit. in Footnote 20, Sect. 2.6, p. 12. 

^ See, for instance, W. Qiu-Dong: The global solution of the N-body problem. Ce- 
lestial Mech. and Dyn. Astron. 50, 73-78, (1991). 

^ D. G. Saari: A visit to the Newtonian A-body problem via elementary complex 
variables, Amer. Math. Monthly 97 , 105-119, (1990). 
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Sundman’s Method 

Consider a Newtonian force field in which the particles collide collinearly. The 
motion occurs along the positive x semiaxis and, leaving aside the constants, 
the equations of motion are x = — as the right-hand side is always 
negative, the solution x{t) has a downward concavity. This forces a collision 
to occur in the system, going backwards in time; furthermore, if x{t) < 0, 
then X will also remain as such in subsequent instants of time. Multiply 
both sides by x and integrate to obtain the energy integral: 

x'^ = - + 2h. (4.14) 

X 

If a collision occurs at a particular instant, which for the sake of simplicity 
we shall assume to be t\ = 0, then the energy integral becomes x {x)‘^ = 
2 + 2/ix ~ 2, that is, X ~ —2^/^. By integrating this, one gets x^^‘^{t) ~ 
— 3t/\/2, that is: 

/q\ 

x(t) = f - ) for t 0. (4.15) 

We saw in Sect. 3.3 that in the general problem of N bodies, a necessary 
and sufficient condition for there to be a collision at the time t — 0 is that 
the potential goes as The behaviour of (4.15) then corresponds to the 

general case. What the nature of the singularity is remains to be discovered. 
It is possible to demonstrate (here we shall do it only for particles colliding 
collinearly, but the result is true in general) that a binary collision always 
corresponds to an algebraic branching point. Let us put x{t) = X{t) and 
substitute in x = —x~‘^; this yields 

t^X + jtX-^X = -X_ (4.16a) 

To transform this equation into an equation which admits analytical solu- 
tions, we make a change of the independent variable: s = Then 

X" + 2sX' -2X = -X, (4.16b) 

where the prime indicates differentiation with respect to s. Using standard 
methods (series solutions and the method of majorizing to demonstrate the 
convergence) one can show that (4.16b) has a solution which is analytical in 
s. Furthermore, in a neighbourhood of the instant of collision the solution is 

x{t) = akt^ . (4.17) 

k 

From (4.17) it follows that a binary collision is an algebraic branching point; 
locally s = is suitable for regularizing the equation of motion. Equation 
(4.17) also suggests, at least from a heuristic point of view, an explanation of 
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how the change of independent variable operated by Sundman works. Unlike 
what we have supposed (binary collsion for t ^ ti)^ we do not know a priori 
when the collision will occur. It is known, however, that it occurs if and only 
if r tends to zero. Therefore, the growth properties of r enable us to establish 
the instant at which the collision will occur. Consequently, it is necessary to 
choose a suitable exponent a for r, just as can effectively replace the term 
{ti - in the change of independent variable. The asymptotic relation 
(4.15) shows that it must be o; = 1, so that a natural choice for the change of 
the independent variable is ds — dt/r{t). This is the very change operated by 
Sundman. In this way, the real singularities are eliminated; however, we must 
take into account that complex singularities may exist, a further cause of the 
decrease in the radius of convergence of the series. Indeed, imaginary collisions 
among particles for imaginary values of the time exist. To understand how 
these imaginary collisions are connected with the actual behaviour of the 
system, we shall consider the elliptical solutions of Kepler’s problem: 

r{t) - (4.18) 

The solution r{t) is implicitly defined by (2.50): 

r = a (1 — e cosu ) , 

with u a function of the independent variable t through Kepler’s equation 
(2.53). The elliptical solutions are well behaved and there is no possibility 
that collisions occur; one might then think that the series expansion converges 
for every value of the time t. This, on the contrary, does not happen, because 
there are complex singularities. Putting u = u\ -\- iu 2 and t = ti -\- it 2 ^ not 
only has it been calculated that an imaginary collision occurs when r — 0 
(for complex values of u and t), but it has also been found that the real part 
of t is determined exactly where r(t) reaches its minimum value on the real 
axis. Furthermore, the smaller the value of this minimum of r{t) (that is the 
greater is the value of the eccentricity e), the closer the complex singularity 
is to the real axis. This suggests that, if somehow the three bodies are forced 
to remain far from a triple collision, then perhaps the complex singularity 
remains far from the real axis. 

Sundman demonstrated that, if c 7 ^ 0, then for each value of t 
mdix {\ri (t) — Vj {t)\} > D{c) >0. 

This means that, not only do triple collisions not occur, but the system keeps 
itself far from such collisions. By applying the above argument and Cauchy’s 
existence theorem, Sundman showed that the system does not admit complex 
singularities in a strip of the complex plane (dependent on the value of c) 
containing the real axis. This strip is the analytical domain of the existence 
interval (that is, the real axis) of the solution which, as we have seen, is 
well behaved, since both the real collisions and the complex ones have been 
eliminated. Now, it is sufficient to find the conformal transformation able to 
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bring the strip obtained in this way into a unit disk. For instance, if the strip 
is defined by |Im(s)| < j 3 ^ the change of independent variable is 

(7= + 

In this new system, the motion equations do not have singularities in the unit 
disk, and so the series expansion converges. Along the real axis in the unit 
disk, a corresponds in a one to one fashion to all the real values of time. In 
this way, Sundman demonstrated the existence, for the three-body problem, 
of a convergent series expansion for every real value of time. However, the 
solution thus obtained is not of any practical use, because it converges too 
slowly and hence too many terms have to be taken into account to achieve 
an acceptable accuracy. Remember that two changes of independent variable 
have been performed, the first {s = having been used to eliminate 

the binary collisions. This results in a “slowing down” of the dynamics: the 
numerical value of s tends to be greater than the corresponding numerical 
value of t. By the second change, the greatest values of s are exponentially 
transferred into the unit disk of the complex plane a. Then small values of 
t may be identified with values of a close to the boundary of the unit disk. 
When this happens, the speed of convergence of the series is slowed down. 



Levi-Civita Regularization 

To introduce Levi-Civita regularization, it is convenient to consider first the 
case of planar motion. Let us assume then that the three points Pi, P2, P3 
always move in a plane, which we shall assume is the xy plane (see Fig. 4.4). 
Moreover, we shall set the origin of the coordinate system {x y) at the point 
Pi; our coordinate system will have a moving origin (with respect to the 
inertial barycentric system); however, we shall assume fixed directions for the 
coordinate axes. As coordinates of P2 and P3, we shall take the Cartesian 
coordinates X2, 2/2 ^ ^3 5 respectively; as conjugate momenta we shall take 
instead the components of the momenta with respect to the barycentre B 
of the three bodies. We shall call P2 and the respective vectors; the 
analogous momentum of Pi will consequently be given by — {p2 +^3)* The 
potential function will be given, as always, by 



mi m2 m2 m3 m3 mi 



with the symbols having their usual meanings. If we denote by 7i, ?2, Ts 
the kinetic energies of the three bodies, then 

= 2^ [{P 2 x ~^P 3 x)‘^ + {P 2 y -^PSy)‘^]^ 

and T = 7 i+ 72+73 is the total kinetic energy. 
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We shall 
Hamiltonian 
given by 



define as the Hamiltonian the function T-L = T — and the 
equations of motion of P 2 and P3 with respect to P\ will be 



dxi 


_ dn 


dpix _ 


dn 


dt 


dpix’ 


dt 


dxi 


dyi 


_ dn 


dpiy 


dn 


dt 


dpiy' 


dt 


dvi 



(4.20a) 



with the energy integral 



1-L — T—U — h — const. (4.21) 

From what we have learned from Sundman’s and Painleve’s theorems, we 
know that for c 7^ 0 and N = 3 the singularities that we may meet are 
exclusively due to binary collisions. Let us concentrate on the collision of P\ 
and P2, meaning that whatever conclusion is reached it can be repeated ex- 
actly for the other two kinds of possible collisions. From previous discussions 
about the nature of the singularities and also from the study of the regular- 
ization of the two-body problem, we obtain the elements with which we can 
schematize the question as follows. With t tending to the instant t\: 

a) lim r = 0; 

t — 

b) position and (barycentric) velocity of the third body P3 , i.e. ^3,^3 and 
Psx^Psy^ tend to finite and well-defined limits when t tends to ti; in 
particular 

lim r23 = lim rsi > 0; 

t — yti t — ytx 

c) the fraction 1/r becomes infinite for t — > ti, because of (a), but remains 
integrable, and, by putting ds — dt/r^ one defines (up to an irrelevant 
additive constant) a parameter s always increasing with t and tending to 
a finite value when t tends to ti; 

d) from the energy integral, by multiplying by r and passing to the limit, 
one obtains 
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r 

lim rT — lira - 

t — >^ti t — yti 2 






(the velocity of P 2 becomes infinite because of (4.21)). 

That having been stated, we pass from system (4.20a) to another system more 
suitable for subsequent considerations. Since system (4.20a) does not change 
if a constant is added to T-L, we can substitute the quantity T-L — h for H. By 
doing this, one confines oneself to that kind of solution which corresponds to 
a well-defined, however entirely arbitrary, value of h; the advantage then is 
that T~L — h vanishes along each one of the solutions under consideration. If 
now we put 



=r {n-h)=rT3 + r {Ti ^ T2) - rU - r h (4.22) 



and replace the independent variable t by 5, as defined in (c), system (4.20a) 
becomes 



dxi 


* 

1 


dpix 


* 


ds 


dpix' 


ds 


dxi 


dyi 


_ dH* 


d Piy 


dn* 


ds 


dpiy' 


ds 


dyi 



(i-2,3). 



(4.20b) 



with the first integral = const. 

We are concerned, however, only with the particular kind of solution where 
7^* = 0, which corresponds to that of system (4.20a), where H = h. Let us 
now see what improvement has been made by the introduction of the func- 
tion H*. If we look at the analytical structure of Ti* for t ^ ti, we see that 
something has been obtained: the infinities actually disappeared, both in U 
and in 7i + ?2 (since they have been multiplied by r); however, we are not 
yet dealing with a function that is regular with respect to all the variables. 
In fact, since W* contains r, the function presents a critical point with re- 
spect to X 2 and 1/2 , when these vanish; furthermore, P 2 x P 2 y^ because of 
(d), grows indefinitely, and then, also with regard to the variables p 2 x and 
P 2 y^ the collision of Pi and P 2 is not included in the domain of regularity. 
We must therefore resort to a further transformation which involves the vari- 
ables X2, V 2 and p 2 x^ V 2 y The required transformation is the one we have 
already dealt widely with in Sect. 2.6 (the Levi-Civita transformation). We 
shall therefore write 



Z2^X2 + iy2 = {^2+imf (4-23) 

This is merely a point transformation; we shall correspondingly choose conju- 
gate momenta il2^, n 2 n so as to ensure that the transformation is canonical. 
We shall impose the condition that 

P2x dX2 + P2y dy2 = Il2^ d^2 + drj2 (4.24) 

becomes an identity by substituting (4.23) into it. Explicitly, 
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P2x + ^ P2y — 



772 ^ + i Il2r] 

2 (6 -*%)' 



(4.25) 



Equations (4.24) can actually be obtained by differentiating (4.23), changing 
i to —i in (4.25) and then multiplying the resulting equations and taking the 
real part of the product. As r = \z 2 \^ r — IC2P and, from (4.25), by equating 
the squared moduli, 

2 , 2 “^2^ 4- 772 ^ 

P2x^P2y= ^ 

that we can also write 

= ( 4 . 26 ) 



From (4.26) and from (d), we finally get 



r 

2 



{pIx + ply) 



+ ^2r) 
8 



^ m\ml 
mi + m2 ’ 



which guarantees that 77|^ + 77|^ remains finite when the two bodies P\ 
and P 2 tend to collide. This, together with the fact that one can see that 
the direction Pi P 2 tends to a well-defined limit, ensures that the same 
occurs for 772^ and Il 2 r^ separately. Multiplying both sides of (4.25) by 
^ = (6 + ^^2) (6 - ^^2), one obtains 



r {P 2 x +iP 2 y) = 2 (^ 2 ^ + i n 2 n) (6 +iP 2 )- 



(4.27) 



From (4.23), (4.26) and (4.27), the result is therefore that 
2(2, 2/2, r, r {pI^ +ply) , rp 2 x, rp 2 y 



are all second-degree functions, in the new variables ^ 2 , P 2 -> ^ 2 ^ and 772ry, and 
are regular in the neighbourhood of ^2 = 0, 7^2 = 0 and of the finite values 
of 77^2 and Il 2 y corresponding to a collision. As far as l/r23 and l/rai 
are concerned, which already behaved regularly in the neighbourhood of a 
collision as functions of X2, V 2 and X3, ^3, it is clear that such a condition 
is also maintained by the substitution of ^2, V 2 for X2, ^2- 

In conclusion, therefore, the complete regularization of 77* has been 
achieved, and, together with it, also the regularization of the system of the 
equations of motion, which is still a canonical system, since a canonical trans- 
formation has been performed. The new system with the Hamiltonian® 77* 
expressed in the new variables will be 



d^2 


m* 


dll2i _ 


dH* 


dX3 


_ dH* 


CO 

s 


1 

* 


ds 


~ dH2i ' 


ds 


d^2 ’ 


ds 


dpsx' 


ds 


dxs 


dri2 


m* 


dll2y _ 


dH* 


dys 


dH* 


dpsy 


dH* 


ds 


~ on2y 


ds 


dr]2 ’ 


ds 


1 

CO 


ds 


dys 



6 



Remember that for us 77* = 0. See also the discussion in Sects. 1.14 and 2.6. 
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If we pass from the planar problem to the spatial one, clearly the num- 
ber of degrees of freedom increases by two, since now three coordinates are 
needed to characterize each of the two points P 2 and P 3 . We shall write 
now X 2 , V 2 i Z 2 and X 3 , ys, zs for the coordinates of the two points and 
P 2 x 5 P 2 y^ P 2 z and psx, Psy^ Psz for the conjugate momenta, still given by the 
components of the momenta with respect to the barycentre. Moreover, 



Ti = 



1 



{P2x + {P2y PP3y)‘^ + {p2z + PSz)‘^ 



2 mi 

72 = 7T^ [pL+P2j/+pL], 



2 m 2 

T — Ti + T2 + Ts, 



n 



Ta 

T-U. 



^JpL + pI + pI], 



Since the transformation given by (4.23), owing to its intrinsic nature, is 
not extensible to three dimensions, we ought now to find a canonical transfor- 
mation, and not just a point transformation, and hence a transformation more 
general than (4.23), which enables us to regularize the new Hamiltonian sys- 
tem built up starting from (4.29). As we know, after unsuccessful attempts, 
Levi-Civita^ attained the desired transformation about ten years after the 
publication of the results obtained by Sundman. Levi-Civita’s method con- 
sists in the introduction of a fictitious intermediate motion of parabolic type. 

Let us now see what this means. If we consider the body P 2 at an instant 
t, with momentum P 2 with respect to the barycentre, we can also imagine 
the fictitious motion of a point mass which occupies at the instant t the same 
position as P 2 but for which the momentum P 2 is the momentum with 
respect to the point Pi, where we have located the origin of the coordinate 
axes. We can furthermore consider it subjected only to the Newtonian attrac- 
tion exerted by Pi, that is, subjected to a potential given by Gmi m 2 /r. We 
then have the motion of a body subjected only to the Newtonian attraction 
of a fixed centre. If h is the total energy of the three-body system, we can 
consider the fictitious motion having the same total energy, for example 

1 \P 2 ? _ Gmi m2 

2 m 2 r 



Even more generally, one can consider the motion of a body of unit mass 
subjected to a potential whose constant is defined by means of the equation, 
analogous to (4.30), 

= (4-31) 

If < 0, we have an intermediate elliptic motion with the same energy as 
the true motion of the system. If = 0, we have instead a parabolic motion 
as the fictitious motion, which, as we shall see, regularizes the true motion. 



^ T. Levi-Civita: Sur la regularisation du probleme des trois corps, Acta Math. 42, 
99-144 (1918). 
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Let us therefore consider the fictitious motion described by the Hamilto- 
nian 

= \ iplx+ply +pIz) “ “ = 0. (4.32) 

By defining dt = r ds as before, the solutions for the parabolic motion will 
satisfy the system 



dp2x 


d {rH) 


dX2 


_ 9{rH) 


ds 


9X2 


ds 


1 

to 


dp2y 


9 (rH) 


dV2 


9{r'H) 


ds 


9y2 ’ 


ds 


9p2y 


dp2z 


9 {rH) 


dZ2 


9{r'H) 


ds 


9z2 


ds 


1 

to 



(4.33) 



As we know (Sect. 1.15), all the solutions of system (4.33), taking into account 
(4.32), are represented by a complete integral of 



1 

2 ^ 



{pIx + ply + pIz) = F{^2, m, C 2 ) = const., 



where ^ 2 , C 2 are three constants which must satisfy F {^ 2 ^ ^ 2 , C 2 ) = 
since 7^ = 0. If we introduce polar coordinates r, d, ip in the usual way, the 
corresponding Hamilton- Jacobi equation will be 



r f (dw^y 1 (dw,y 1 fdWiy 

2 \ \ y \ dd J sin^ d \ dip J 



const. = k. (4.34) 



One can see that (4.34) has a complete integral independent of ip given by 



Wi = Wi (x 2 ,y 2 ,Z 2 ; 6 ,%,C 2 ) = \/2gr\/l - cos-d = \pis/Qr - g r, 

(4.35) 

where 



6= cos^? 



6 X2 + ri2y2 + C 2 Z2 Q -r 



gr gr 

By substituting this in the Hamilton- Jacobi equation (4.34), one obtains g/2 
on the left-hand side and so 

g^2k. (4.36) 

The equations which define the motion will then be (see Sect. 1.15) 

Q 

V2 _'n2\ dWi ^ _ Q (V2 V2 

r g )’ dy 2 ITi V 6 

^ C 2 ' 

wAr gj’ dz2 ~ wAr 



_ rr _ ^ ^2 

d ^2 ITi V r g 

.^-n -JL 

drj2 Wi 

dWi 



dWi g 



r 



9(2 



= n- 



2 C 



(4.37a) 
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P2 = 



that is . 

TT =JLr--^ 

U qJ' "" 

and then 

rp2 = QII 2 

On the other hand, since (4.34) is of type (1.C.97) 

dF _ 1 772 

dm 2 p ’ 






96 2 q ' 

and, in vector form. 



n 



2 — 



1 9. 

2 p- 

The integration of (4.39) immediately gives 

iT2 = -J-id 

2 Q 



a 



iJ. 



2C 



OF 

'dC2 



(4.37b) 

(4.38) 



1 ^ 

2 p 

(4.39) 



(4.40) 



where g and a are constant vectors. If we vectorially multiply (4.40) by g, 
we get 

g X TI2 — g X a = C2, (4-41) 

where the vector integration constant has been called C2 in view of its mean- 
ing, which will now be clarified. In fact, by vectorially multiplying the first 
of equations (4.37b) by g and the second by r we also get 

I II 

Q X n2 = C2 = — {q X r) , r XP2 = — {-r X q) = — (gxr) , 

that IS, r X P 2 = g X II2 = C2, which expresses the constancy of the angular 
momentum r x P2, simply given by the vector C 2 . 

The equation of the plane of motion will be obviously provided by C 2 • 
r = 0. Equation (4.41) also ensures that the vectors g and II 2 belong 
to the plane of motion. From the expression (4.35) for we have ITf = 
2 {gr — g ' r) , and from the first of (4.37b), 



\n 



2 1 = 



2 (r -2 - r ^ . r) 






r 

g' 



Also, because of (4.37b), since g • II 2 = {g • r — gr) /Wi = — W\j2, we 
finally obtain 

^ I JT2 P - (e • U 2 f = e^\n 2 f = rg-^{gr- Q-r) 

= i(^r + ^-r) = i^r(I + cost^) . 

But because of (4.41), \ II 2 -{q- -^ 2 )^ = |c 2 p- The last relation can be 

written as |c 2 p = \ gr {\ + cos??), that is, defining c = |c 2 |, 

2c^/g 
1 -I- cos ?? ’ 



r = 



(4.42a) 
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which is evidently the polar equation of the trajectory, and which, because 



of (4.36), will become 



(? jk 

1 -h cos 'd 



(4.42b) 



The constant vector whose modulus is 2A:, has the direction of the axis of 
the parabola and points from the focus towards the vertex (epicentre); the 
parameter of the parabola is given by 2 c^/^. 

Let us now go back to our canonical transformation generated by the 
function 



Wi = V 2 - 



+ 2/2 + 4 ) HI + ^2 + Cl) ~X 2 ^ 2 ~ 2/2 m 



which satisfies 



P2xdx2 +P2ydy2 + P2z d Z2 ~ “ n2r)dr]2 ~ c? C2 = dWi, 

as one can see from (4.37a). By inverting the first three of equation (4.37a) 
and by replacing r/g by | /T 2 P, we obtain 

X2 = I n2 P ^2 + 772 ^, V2 = 1 n^2 P r]2 + W\ Il2n, Z2 — I II2 P C2 + Wi 7 l 2 (^, 

(4.43) 

which, together with the last three of (4.37a), are expressions for the “old” 
variables as functions of the “new” ones. In the new variables, we can then 
write a Hamiltonian system, which will be the three-dimensional equivalent of 
system (4.28). In this system, 77* = r (77 — /i), where 77 is given by (4.29) 
but expressed in the new variables, will be the Hamiltonian. This system 
appears to be completely regularized. In fact, when P2 gets close to Pi, 
the modulus of p 2 tends to infinity, but the product r \p 2 P remains finite 
and different from zero. Similarly ^ 2 ^ P 2 ^ C 2 , tend to finite and well deter- 
mined limits, because q has been introduced as a vector whose length is 2k; 
772^5 dl2r], T 72 (^, tend instead to zero, since the modulus | 772 | == (^/t>) IP2 I 
tends to zero. If, instead of the form (of Poincare) chosen for the Hamilto- 
nian system, one considers the form expressed in the Jacobi coordinates as 
defined in the last section, Levi-Civita’s method is applicable as well.^ Of the 
three possible collisions (Pi, P2) , (P2, P3) , (P3, Pi) , we have dealt with 
(Pi, P2). It is clear, as we have already said, that, on repeating our reasoning 
(changing the canonical transformations) for (P2, P3) , (P3, Pi), one reaches 
an analogous result. Obviously, to do that, we have also to perform in ad- 
vance another change of the independent variable. The last change could be 
avoided by choosing, once and for all, a parameter which is symmetrical with 
respect to the three kinds of collision. For the parameter 5 , we had rds = dt; 
if we now take into account that 

^ T. Levi-Civita: loc. cit., pp. 139-140. 
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rU = Gmi 1712 + 



G minis , ^rri2 ms 

r + G r, 

^13 r2s 



considered as a function of the transformed variables (^ 2 , '^ 2 , C 27 ^ 2^5 ^ 277 , ^ 2 c) 
and also of X 3 , 2 / 3 , zs behaves regularly in a neighbourhood of a collision 
(Pi, P 2 ), collision point included, we can even decide to replace s by the 
parameter r, defined by: 



dr = rU ds = U dt, (4.44) 

The parameter r can play the same role as s for collision (Pi, P 2 ) , and 
furthermore, given the symmetrical structure of P, it can play a similar role 
for collisions (P 2 , P 3 ) and (Pi, P 3 ). Therefore we shall write a new canonical 
system, by replacing P* by 

r U 

It must be always kept in mind that the above considerations apply exclu- 
sively to the solutions corresponding to P* =0 and hence /C = 0. In that 
case, the Hamiltonian system will be 
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In (4.45), /C is a regular function of the variables X2, P2, ^3, Ps while the 
positions of the three bodies remain distinct. In the neighbourhood of a col- 
lision, (Pi, P2) , (P2, P3) , (P3, Pi) , a suitable canonical transformation, of 
type (4.37), restores the regularity. System (4.45) is thus to be considered, 
however unregularized, to be always amenable to being regularized without 
difficulty. It should then also be regarded as “solved”, in the sense that a 
solution exists, given by a convergent series expansion for any value of r, 
and then of t. 



4.3 The Restricted Three-Body Problem 

As we have already stressed by writing the equations of motion for the three- 
body system in Jacobi form, by working out all the possible reductions one 
can reach the minimum order six for the system of the equations of motion. 
If one confine oneself to taking into consideration only planar motions, then 
the order of the system goes down to four: however, it is still a problem of the 
utmost complexity. It may be made simpler, albeit not completely solvable, 
if one introduces the approximation of considering one of the three bodies 
uninfiuential on the motion of the other two (called primaries): clearly this 
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is equivalent to considering m3 negligible in comparison with mi and m2 
and the point P3 moving “sufficiently far” from P\ and P2* We shall lastly 
consider the case where the primaries move along circular orbits with con- 
stant angular velocity uj around their barycentre {planar circular restricted 
problem).^ To become acquainted with the simplification which occurs in this 
case, let us again consider the equations in Jacobi form (4.8) and (4.9). 

By putting m3 = 0, the first becomes 



r = 




(4.46) 



and the second, if for the sake of simplicity we write (1712/1^) r = and 
g — (mi//i) r = g2, becomes 



Gmi Gni2 

Qi 92 



(4.47) 



Then, of the two equations, the first describes the relative motion of the 
primaries and can be considered solved already, since we know the general 
solution of the two-body problem; the second equation, since r is known 
from the first one, merely describes the motion of the third body (m3 ~ 0) 
with respect to the barycentre of the first two. The problem is therefore 
brought down to the solution of this second equation. Note that, since the 
approximation m3 ~ 0 has been made, we can no longer resort to the usual 
first integrals to lower the order. We may, however, resort to other kinds 
of simplification. First of all, since both the primaries rotate with constant 
angular velocity around their barycentre, we can take as the reference frame 
in which to study the motion of m3 the plane (rotating at constant angular 
velocity uj with respect to the inertial barycentric plane) where the primaries 
are at rest and the x axis coincides with the straight line containing mi and 
m2 {synodic system). If, to spare symbols, we call x\ and X 2 the abscissae 
of Pi and P 2 and x and y the coordinates of P3, (4.47) will have as 
components in the rotating system 



X — 2ujy — lj x = G — ^ {xi — x) P G {X 2 — x ) . 

9i 92 



y 2 UJ X — y 



-G 



mi 

9i 



92 



(4.48a) 



A further formal simplification of these equations can be obtained by a suit- 
able choice of measurement units: 



- choose the unit of mass such that mi + m2 = 1; 

- choose the unit of length such that r = 1 ; 

- choose the unit of time such that G = 1. 

^ The term “restricted” was introduced by Poincare (1892), but the schematization 
of the problem goes back to Jacobi (1836) and Euler (1772). 
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Let us call the smallest of the two masses (for instance, m 2 ) ~p (which is 
dimensionless) and locate it to the right of the origin. Obviously, the other 
mass (mi) will be given by 1 — /x. Since, in the old measurement units, 
mi x\ + rri 2 X 2 = 0, and since now X 2 — xi = r = 1, it will also be the case 
that Xi — — X 2 = 1 — /X. In these units, we also have a; = 1; in fact, for 
the two primaries. 



Gmi m2 



m 2 |x2 1 = mi \xi I 



{uniform circular motion); and then also 

G (mi + m 2 ) /I I , I i\ 2 2 

= (|xi| + \X2\) = ruj% 

from which G (mi + m 2 ) = r^ {Kepler’s third law) and therefore, in the 
chosen dimensionless system, to = 1. 

If we now call the potential in these units U, then 

_ 1 M 



where 

Qi = + 

and system (4.48a) will become 

. dU 
X - 2y - X = — 



{x + fi-1) +y^ 



. 9U . dU 

x-2y-x=—, y + 2x-y=—. 

ox oy 

If, lastly, we define a new potential function modified as follows: 

^ + y‘^) + u + - ji) , 



(4.48b) 



the equations become 



•• o 

x-2y^—, 

ox 



y + 2x 



Although in this case one cannot have, as we have already warned, the first 
integrals corresponding to the usual conservation laws, one can demonstrate 
immediately that (4.50) admit the first integral known as Jacobi integral}^ 
By multiplying the first equation of (4.50) by x and the second by y and 
adding we obtain x x + y y = d^/dt^ from which 






and then 



2^ — XT — ir — G = const. 
We shall call the constant G the Jacobi constant. 



^See Sect. 1.9. 
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The Jacobi integral is therefore a relation which connects, in the dimen- 
sionless synodic system, the square of the velocity with the coordinates of 
the point P 3 (whose mass has been taken equal to zero); in this system the 
equations of motion contain only one parameter: the dimensionless mass JI. 
It appears that one can reduce the fourth-order system above, and in various 
ways, to a second order one by means of the Jacobi integral and the elimina- 
tion of the time. Unfortunately this has only formal interest, because then it 
is not possible to solve^^ the second-order system obtained. 



Tisserand’s Criterion 

Let us recall now, as an application of the Jacobi integral, an (approximate) 
criterion of identification of comets due to Tisserand.^^ The problem is as 
follows: if it happens that a comet passes very close to a planet (for instance, 
Jupiter), then the elements of its orbit will undergo a significant change, 
and later on it may not be easy to recognize the comet itself, unless one 
resorts to properties of the elements which remain unchanged even after the 
perturbation of the planet. Tisserand suggested regarding the system Sun- 
planet-comet as a restricted three-body system, where the comet represented 
the third body of negligible mass. If we take into consideration the system 
Sun-Jupiter-comet, in fact, we are allowed to consider the origin of the axes 
of the inertial system located in the Sun and consequently the orbit of the 
comet in the inertial system will be its heliocentric orbit. Furthermore, since 
the eccentricity of Jupiter is very small (0.048435), we consider its orbit to be 
circular. Finally we take the inclination of the comet to be negligible. At this 
point one can apply to our system the Jacobi integral, expressed, however, 
in the inertial system. If we call the coordinates of P 3 in this system x and 
y, we have 

X = X cost -\-y sint, y = y cost — x sint, (4.52) 

and then we can write the Jacobi integral (4.51) in the inertial (heliocentric) 
system: 

—“2 ^2 ^ 2 ( 1 — 71 ) 271 /, 

X -\-y =2(xy-yx)-\ -\ C. (4.53) 

Qi Q2 

Putting 77 = 0 (Jupiter’s mass negligible), the system Sun-comet will have 
the energy integral 




g being the Sun-comet distance and a the semimajor axis of the orbit of 
the comet. 

^^See, for instance, V. Szebehely: Theory of Orbits. The Restricted Problem of Three 
Bodies (Academic Press, 1967), Sects. 2.2 and 2.3. 

^^See F. Tisserand: op. cit., Vol. 4, p. 203. 
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With /i = 0, one will also have the integral of “areas” and then 

c = xy-yx= y/a (1 - e^), (4.55) 



where e is the eccentricity of the orbit of the comet. By substituting (4.54) 
and (4.55) into (4.53), one has 



2 1 

Q a 



2 yja (1 - e^) + 



2(1-^) ^ 2m 

Q2 



a 



By making the approximations p = /i(^2 ^ ^) = 0, one finally 

obtains 

i + 2 y/a (1 - e2) = C. (4.56) 

Equation (4.56) is the fundamental relation of Tisserand’s criterion. As C is 
constant, at the times to and ti 

— + 2 (1 - el) = h 2 J ai (1 - ef), 

ao ^ ai V 

with the symbols having obvious meanings. In the case where one must take 
account of the inclination of the orbit of the comet. 



h 2 A/ao (1 — Cq) cosio = \-2\ a\ (1 — e\) cosii. 

do ^ d\ ^ 

As we have said, this is an approximate criterion; notwithstanding, if we 
insert the elements of “two” comets observed at different times into (4.56), 
and notice a sizeable difference, we can be sure that they are not the same 
comet. 



The Equilibrium Solutions 

Let us now go back to equations (4.50), with first integral (4.51). Since 
we have seen that the general problem of three bodies admits planar so- 
lutions where the three bodies rotate with constant angular velocity around 
their barycentre, we may expect correspondingly in the restricted problem 
0) the same configurations as equilibrium solutions (with respect to 
the synodic system). Equation (4.51), once the Jacobi constant has been fixed 
by means of the initial conditions, will give us the velocity of the point P3 as 
a function of the coordinates; or, alternatively, for every value of the velocity 
they provide the corresponding position where the point may be located. In 
particular, at the zero value of the velocity there will be a corresponding 
family of curves (zero velocity curves or Hill curves) given by 



^{x,y) = j- 



(4.57) 
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If we rewrite (4.49) by substituting x^-\-y^ — (1 — /i) (1 ~ m) ^ 

we obtain 



^ (ei, ^ 2 ) - (1 - (I el + ^) 

= ^ + (i-p) Q + ^) + (^2-if . 



(4.58) 



One can easily see that ^ is always positive, and, from the last expression 
above, that it has a minimum ^min ==3/2 for qi = Q 2 = 1. That is, ^ has a 
minimum when P3 is at a vertex of an equilateral triangle with P\ and P 2 
at the other two vertices. There are then two possible positions (see Fig. 4.5). 

This case corresponds to (7 = 3. Therefore, for (7 = 3 the Hill curve de- 
generates into the two points, which we call L4 and L5 and which represent 
the equilateral solution of the restricted problem. From system (4.50), it can 
be seen that, if one puts x = 0, ^ = 0 (velocity equal to zero), nonetheless 
X = d^jdx^ y = d^jdy] therefore, if one also wants the acceleration to be 
zero (equilibrium solutions), one must have d^jdx = d^jdy = 0. These 
are also the conditions which have to be satisfied by the coordinates of the 
stationary points of ^ {x^y); the minimum points already found then belong 
to this category. Let us now try to look for such points and then for the 
equilibrium solutions, by following the standard procedure. Hence, let us set 

dgi 8 q2 _ 8qi 8^ 8q2 

8 x 8 q\ 8 x 8 q 2 8 x ’ 8 y 8 q\ 8 y 8 q 2 8 y 

By using the first of equations (4.58), we have 



(1 - /i) ( - -2 

Qi 



h /i ( ^2 2 ) 

V qV Q2 



= 0 , 



{i- fi) ( £>i - 4) ^ ^ P _ J_) = 0. 

QiJ Qi V elJ 02 



(4.59) 
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Consider first the case y ^ 0; the second equation will give us 
(l-/i) 2) ^^(02 2) — 

V QiJ Qi \ Q 2 J Q2 

which, when inserted in the first one, gives ^2 = 1 ^hen also q\ = 1. 
The only solution of these equations (and of the system) is thus = ^2 = 1 * 
Then, for the case ^ 0 , there are only two possible solutions for the position 

of P3 : the two vertices of the equilateral triangles whose basis is the segment 
Pi P2 . By solving 

\/{x + 7lf +y‘^ + +y^ = 1 , 

we obtain 

1 _ , V 3 

X = - — a , y = ± — 

2 2 

for the coordinates of L4 and L5 respectively. 

Consider now the case ^ = 0 . We are left with only the first equation 



(1-m) 



2 

Qi 



xPli _ 

^ M 

^1 



Q2 



1 \ X jJ. — 1 



Q2 



Q2 



= 0 , 



with Qi = \ x ii \ ^ Q2 = \ x ji — 1\. There are three possible cases: 



a) P3 to the left of Pi: 

b) P3 between Pi and P2: 

c) P3 to the right of P2: 



X < —JI, 

— p < X < 1 — p, 
X > 1 — p. 



For these. 



a) Qi = —X — p, 

b) Qi=x + y, 

c) Qi=X + Ji, 



Q2 = l-X-Ji=\ + 

Q2 = l-X-Ji=l- Qi, 
Q2 = X + J1 - I = Qi - 1. 



Let us substitute these expressions in the equation to be solved. We see that, 
if we put 



a) £»i = ^, ^2 = 1 + £>5 one has 



(1 -m) 



^ _9 






1 - 






= 0 ; 



b) = ^, ^2 = 1 — one has 



(1 - y) 

0) ^2 = ^5 ^1 = 1 + one has 



1 - ^ 



1 



= 0 ; 





Q /1 \2 


+ (1 - /i) 


1 ■ 








L ^ J 
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In all three cases the corresponding equation has only one (positive) solution 
for Q. Let us look at cases (a) and (c) represented by the same equation, 
which can be written in the form 



F{g)^ 



Q- 



1 + ^- 






\--fl 



< 0 . 



One can verify that 



F'(^)>0, F(0+) = -oc, F(1) = 0. 



Then F = — /l/(l — ^) < 0 for only one value of q between 0 and 1 . Let us 
call the two points corresponding to (a) and (c) Li and L2 • 

In case (b) the equation becomes 







1 

F 



> 1 



[n < 1/2 by hypothesis); the function G {g) is increasing in the interval 
1/2 < Q < 1: moreover, 



G(l/2) = 1, G(l_) = oo. 

Then G takes the value (1 —Jl)/F once in the interval 1/2 < ^ < 1. 
We call the corresponding position L3 (which is then closer to mi than to 
m2). The points Li, L2, I/3, L4, L5 which we have determined can be given 
an interpretation that is particularly meaningful from the geometrical point 
of view. If, in fact, we consider the four-dimensional manifold which consti- 
tutes the phase space of our system with two degrees of freedom, we see that 
the Jacobi integral reduces the dimensions of the manifold available for the 
motion of the system to three. Any one 4 -tuple of numbers x, x, y which 
satisfies the Jacobi equation — 2 ^{x,y) — G represents a possible 

motion for a given G. The Jacobi equation is of the type F(x, y, x, y, G) — 0 . 
It is then a three-dimensional manifold whose singular points will be given 
by dF I dx = dFjdy = dFjdx — dFjdy = 0 (we exclude from these consid- 
erations possible singularities of ^ (x, y) which we shall deal with when we 
study the regularization of our problem). In our case, the written equations 
become dFjdx — dF/dy — x = y = 0 . By substituting these in the equa- 
tions of motion, one additionally obtains x = y — 0 . Then the singular points 
of the three-dimensional manifold of the states of motion are the stationary 
points, or equilibrium points, which we have already determined. 
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1-M 
^ L5 






Fig. 4.6 



The Hill Curves 

We have already defined the zero velocity curves, or Hill curves, as being 
the curves belonging to the family ^{x,y) = C/2, with C > 3. Taking 
into account that ^ is somehow a potential function, the Hill curves are the 
equipotential lines. We have already seen that for C = 3 the corresponding 
curve degenerates in the points L4 and L5 . Moreover, the symmetry prop- 
erty ^ (x, y) = ^ (x, —y) is clear, since in qi and Q2 y is squared; then the 
curves ^ = const, are symmetrical with respect to the x axis. Let us now 
study the behaviour of the curves as the Jacobi constant varies (Fig. 4.6). In- 
creasing C a little above the value 3, one has two closed curves which enclose 
the points I/4 and L5 . The part of the plane inside these curves corresponds 
to = 2^ — C < 0, the part outside to >0. Therefore these curves 
separate the regions in which the motion is possible (real v) from those in 
which the motion is not possible (imaginary x). If C increases further on, 
the two curves will join at Li. After a further increase which will cause the 

^^G.W. Hill: Researches in the lunar theory, Am. J. Math. 1, 5-26, 129-147, 245- 
261, (1878). 
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forming of a single closed curve containing L 4 , L 5 , Li, there will also be a 
joining at L 2 and finally at L 3 , encircling the primaries. A further increase 
of C makes the joining at L 3 disappear, leaving the two primaries encir- 
cled. Figure 4.7^^ (which must be completed symmetrically) together with 
Fig. 4.6 gives an (approximate) idea of what we have said; the last phase is 
not represented in the figure. The curves refer to Jl/{1 —Ji) = 0.1. 

One can apply what has been explained above to the motion of the Moon. 
If we neglect the eccentricity of the orbit of the Earth and the inclination of 
the Moon, one can schematize the Earth-Sun-Moon system as the circular 
restricted problem we have so far dealt with. Therefore the question is that 
of calculating the Jacobi constant for the Moon: C — 2^ — ^ v being 

the velocity of the Moon with respect to the rotating coordinate system. 
One obtains a value Cuoon > C 3 for the Moon, C 3 being the value of 
the Jacobi constant for the curve passing through L3. Therefore the Moon 
remains inside the zero velocity oval, which in turn is inside the eight-shaped 
limiting figure: hence it cannot leave the Earth. For a thorough analysis of 
the Hill curves we refer the reader to the already mentioned textbook by 
Szebehely. 



C = 3.65 




Fig. 4.7 



4.4 The Stability of the Equilibrium Solutions 

System (4.50) with Jacobi integral (4.51) admits, as we have seen, the equi- 
librium solutions made up of the five Lagrangian points grouped into the 
collinear solution and into the equilateral solution. In view of their applica- 
tions, studying the stability of such solutions is of the utmost importance. 
From a physical point of view, to say that one of the Lagrangian points is 
a position of stable equilibrium simply means that a particle which occupies 

^^Taken from G.H. Darwin: Periodic Orbits, Acta math. 21, 99-242 (1897) 
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that position tends to go back to it whenever it is taken away from it by a 
perturbation. Given the approximations made to schematize the restricted 
problem, with the consequent impossibility of resorting to the usual con- 
servation laws, the classical Lagrange-Dirichlet theorem, which states that a 
position of stable equilibrium is an isolated minimum for the potential energy, 
does not help at all. We shall study the stability of our solutions by applying 
the definition given in Sect. 1.2 (L-stability). As we shall see, it is not a quick 
matter; in fact, in the case of the equilateral solution it was only in recent 
years (the 1960s), two centuries after the formulation of the problem, that 
an exhaustive and definitive answer has been obtained. 

It is convenient, especially for the study of the last case, to make use of 
the Hamiltonian formalism. If we define 



qi=x, p^=x-y, 
q2 = y, P2 = y-\-x, 
and the Hamiltonian function 

^ = ^ {pi+pI) - {(iiP2 - q2Pi) + ^ {ql + ql) -^(91,92), 

then system (4.50) is equivalent to the Hamiltonian system: 



m 

^ ^P\+ 92, 
OPl 

r)T-l 

^ ^ 



dn 

Pi = =P2-qi + 

dqi dqi 

dn d$ 

P2 = = -Pi - 92 + ^. 

092 OQ2 



(4.60) 



(4.61) 



(4.62) 



The equilibrium solutions correspond to the critical points of the function 
0.2) — ^ y) • If we generically call the coordinates of these points 
X = a, y = then qi = a, ^2 = b, pi = —6, P2 = cl will correspond to them. 
If we perform the translation — a = ^1, q 2 ~b = ^2 and, as a consequence, 
Pi -h 6 = 771, p 2 — a — t] 2 , we find that this is a canonical transformation, and 
that we can then write a new canonical system in the variables ^1, ^2, Vu P 2 ^ 
for which the equilibrium solutions will correspond to = ^2 = '^1 = ^2 = 0- 
This system is 



• dn . 



with 



Vi 



m 

d^i 

dn 



d^ 



, d$ 



(4.63) 



= \ (^1 +’12) - (692-691) + \ (Cl +Cl) -^(6,6). 



(4.64) 



To spare symbols, we have maintained both H and ^ for the functions in the 
new variables. 
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To study the stability of the solution = ^2 == = '^2 = 0 , we must lin- 

earize system (4.63); by expanding it in a Taylor series in the neighbourhood 
of the origin, we get 



= <^2 + ^ 1 , ^2 == -^1 T 772 , 

(l|) L + (44) L + * + O*- (4^65) 

* “ + (44) l/‘ + (4) 4" ■ ^ 



where O 2 and O 2 will not contain terms of the first degree in ^ and 77 . For 
the Hamiltonian, one has the expansion 



vl) - (Cl »?2 - C2 m) + l (Cl + C2) 



C? + 2 



52 ^ 

5Ci d^2 



+ T~Ls d" ^4 H" 



where T-Li represents a homogeneous polynomial of degree I in the canonical 
variables, and the constant term has been omitted, because it will be unin- 
fiuential in the equations which we shall use. The linear part of system (4.65) 
is of type (1.A.19), with a matrix of coefficients 



/ 0 1 10 
-1 0 0 1 

and the corresponding characteristic equation 

+ (4 - - ^^ 2 ) = 0- (4-68) 

We shall now calculate the three second derivatives of ^(^ 1 , ^ 2 ) at the five 
Lagrangian points. Because 

lo ^ i^xx) I x=a , 
y=b 

(^6^2) lo ^ i^xy) I x=a , 
y=b 

(^^2^2) lo ~ i^yy) I ^—9 ’ 

y=b 




we shall start from the expressions of and in (4.59) to calculate the 
second derivatives. Then 
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(1 - m) \qi - 3 (a; -h /r)^] /i - 3 (a; -b /r - l)^j 
— 1 c E 






^2 



^ ^ 3 (x + jLf) (1 -m) , 3 (x -b /i - 1) m • 2/ 

xy 5 y < 



el 



el 



(4.69) 



^ (1 -/i) (e? -3y2) /x(^2_3y2^ 

^yy — i ^ cr r: . 






qI 



Let us first examine the collinear solution. Immediately we see that ^xy — 
0 and 

^ (1 -m) (£•! -3£>i) ^(£>2-3^1) 

^XX — K c: 

Q2 

2 (1 — 7Z) 211 

= 1 + ^ o ^ + -4 > 0. 

Q2 

The evaluation of the sign of ^yy requires a slightly more complicated cal- 
culation. By referring to case (b) of Sect. 4.3 (that is, to the point L 3 ), we 

have _ _ _ _ 

^ l-M l-/il H 

^yy ~ ^ ~ ^ n ^3 ' 

Q 2 Qi Qi Q 2 



Since = 0 , 



(a; -b /i) (1 - /i) + 

tX/ Q Q LX « 

qI 

_ 1 — ji ji 

Ql — 2 ^ 2 ~ 

Qi qI 

_ Ji _1-Ji 
ei 2 2 * 

el el 

By substituting this in the expression for 0yy, we get 

. f - M -/1 1 1 

V' el) ei el~^[ei eiel el 



-el~ 62- ei _ £>2 - 1 M 1 \ ^ n 

= M — r-3 — = = V i ^ " ;;3 • 

Qi Q2 Qi \ Q2y 



eiel 



Q2 < L 



By means of analogous methods, one could verify that cases (a) and (c) give 
the same result. As regards the collinear solution we then have 

^xx ^ O5 ^yy ^ ^xy — 0- 

If we put A = A^, the characteristic equation (4.68) in our case will be 

-T 6 A “h c = 0 , 

with 



b-=A-^xx-^ 



yy^ 



c — ^xx ^yy ^xy ^ 
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Then A == ±y/A and the two real roots for A will have opposite signs: 

Ai = ^ (^—b + \/h^ — 4c^ >0, A 2 = ^ (^—b — \/b‘^ — 4c^ < 0, 

since c < 0, V b/\b\. Therefore \\^2 are real and As ^4 pure imaginary. Hence 
one of the four roots will have a positive real part; as a consequence, based 
upon what seen in Sect. 1 . 2 , the collinear solution for system (4.50) is unstable 
for all the three points Li, L 2 , L 3 . 

In the case of the equilateral solution, by substituting the values x — 
1/2 — /i, y = zb\/3/2 found in Sect. 4.3 into (4.69), we have 

^ 3 ^ ,3y/3,, ^ 9 

^xx — ^xy — 4: ^ (1 — 2 /x) , ^yy — 

By substituting these into (4.68) one has the characteristic equation 

A"^ + A^ H — — /i (1 — /i) = 0. (4.70) 

With the previous symbols. 



A 




-1 ± v/l-27^ (1-/Z) . 



One can immediately verify that the only possibility for solutions that are 
not unstable is for 

(1 - m) < (4.71) 

In this case, one obtains 

< Hi < 0, -^ > H 2 > -1, 

and then all the four solutions Ai, A 2 , A 3 , A 4 will be purely imaginary. We 
have seen, however, that, in the case of purely imaginary solutions, the linear 
system is not sufficient to determine the behaviour of the solution of the 
non-linear system: we are facing a critical case. As we already anticipated at 
the beginning of this section, the question has remained unsolved for about 
two centuries. After a first result due to Littlewood,^^ who demonstrated 
the stability of the equilateral solution for a very long (depending on 7 ^) but 
not infinite time, the stability for an infinite time has been demonstrated by 
applying a theorem, due to Arnold, concerning Hamiltonian systems with 
two degrees of freedom. Let us take a look at the contents of this theorem in 
our case. Go back at the Hamiltonian (4.66) whose form is 



T~L — 7^2 + 7^3 + 7/4 + 7^5 + • • • 



E. Littlewood: On the equilateral configuration in the restricted problem of 
three bodies, Proc. London Math. Soc. (3) 9 , 343-372, (1959); The Lagrange 
configuration in celestial mechanics, ibid. 9 , 525-543, (1959). 
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For I/ 4 , by substituting the values of the second derivative, one immediately 
gets 



^2 = ^ [vl + Tll) + 6??1 + (1 - 2/x) 66 , (4.72) 



which turns out to be an indefinite form. One can apply a linear canonical 
transformation^^ with constant coefficients to 7^2 such as to put it into the 
form 

7^2 = Qi P\ iuj2 Q2 P21 



with 

Ui — \/— yll, LO 2 — \/— ^2, 

which is a Birkhoff normal form; since in our case the roots of the character- 
istic equation are simple and purely imaginary, 



Pk — i Qk 2 ), 

and then, putting 

Qk ^k “b i (^k 7 Pk ~ f^k “b ^ 7 

we can write the second-degree term 7^2 as a sum of squares. Furthermore, 
still using a canonical transformation, one can find new variables, which we 
shall keep calling Qk^Pk^ such as to transform the subsequent terms to get 

T-L — —iuj\Ui + iiJ2U2 + — + 2/ii2t/it/2 + 1 ^ 22 ^ 2 ^ + 7^5 + • * * 7 

where Uk = QkPk {k = 1 , 2 ) and we know that uj\jijJ 2 7 ^ min {m,n = 
1,2, 3, 4) and that the coefficients fiij are real. Under the further condition 
that 

7^4 = - (M11 Ui + 2 /ii 2 U1U2 + /i22 u|) 

is not divisible by 7^2 = ~ Ui + 2 o ;2 U 2 , it can be demonstrated that the 
origin is a stable solution for the corresponding Hamiltonian system. This 
is the conclusion of Arnold’s theorem. The application to the case of the 
equilateral solution is due to A.Leontovich (1962), A. Deprit and A. Deprit- 
Bartolome (1967) and A. Markeev (1969). 



^^For a demonstration, see Siegel and Moser: op. cit., Sect. 15, or G. D. Birkhoff: 
Dynamical Systems, rev. edn. (Am. Math. Society, 1966), Chap. III. The actual 
determination of the transformation requires complicated calculations, see also 
K. R. Meyer, G. R. Hall: Introduction to the Hamiltonian dynamical systems and 
the N-body problem, (Springer, 1989), Chaps. II. G, VII. D. 

^^A demonstration of Arnold’s theorem can be found in Siegel and Moser: op. cit., 
Sect. 35. 
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Our final conclusions^^ are as follows 



The Lagrangian points of the equilateral solution are stable for all the 
ratios of masses which satisfy (4.71), with the exception of the two ratios 

15- 



30 

In the usual units 



= 0.0135160. 



45- 






90 



= 0.0242938 . . 






m2 



m2 < mi; 



mi + m2 

therefore, one can see that in the Moon-Earth case, where 



^Moon 
^Moon T ^0 



0.01241 . . . , 



we are a little below the smallest of the two forbidden values. 

Inequality (4.71), resolved by taking into account that p < 1/2, on the 
other hand yields 




(this number has been named the Routh value), from which one can deduce 
that the upper limit of the ratio m 2 /mi is about 1/25: any planet (Jupiter 
included) of the solar system may then constitute together with the Sun a 
pair of primaries for an equilateral solution. 

The discovery, in 1906, of an asteroid oscillating around the position which 
we have called L 5 in the Sun- Jupiter system was the first confirmation that 
the Lagrangian points were not simply a mathematical curiosity but, on the 
contrary, had applications even within the solar system. After this first as- 
teroid (which was named Achilles), more than twelve other asteroids were 
discovered oscillating around L 4 and L 5 in the Sun-Jupiter system. The 
asteroids oscillating around L 5 were named after Greek heroes in the Trojan 
war, while those oscillating around L 4 were named after Trojan heroes; al- 
together the entire group is called The Trojans. Those oscillating around L 4 
follow Jupiter in its motion, while those oscillating around L 5 precede it. 

Another application of the equilateral solution was suggested in 1961 by 
the Polish astronomer Kordelewski for the Earth-Moon system: he discovered 
clusters of meteorites in the proximity of the triangular libration points. They 
can be observed, under the best conditions, as a faint luminosity. Eventually 
the Voyager I mission showed another example of the application of the 
equilateral solution to the Saturn-satellites of Saturn system: a satellite was 
discovered (called Dione B) which oscillated at the same distance from Saturn 
as the satellite Dione (now renamed Dione A). As far as the collinear solution 
is concerned, some time ago,^^ it was suggested that it could explain the 

^^See K. R. Meyer, G. R. Hall: op. cit. in Footnote 16. 

Gylden: Sur un cas particulier du probleme des trois corps. Bull. Astron. 1, 
361-369 (1884); F. R. Moulton: A meteoric theory of the “Gegenschein” , Astron. 
J., 21 , 17-22 (1900). 
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Gegenschein phenomenon, a faint luminosity which is observed after sunset 
in the plane of the ecliptic in a direction opposite to that of the Sun. This 
luminosity is ascribed to the illumination by the Sun of meteoric particles 
that have accumulated around point L 2 . This is an interpretation which was 
subject to a number of criticisms, some of them well grounded, and therefore 
not universally accepted. 

Applications of the collinear solution can also occur in the domain of 
stellar dynamics. 



4.5 The Delaunay Elements 

for the Restricted Three-Body Problem 

In Sect. 2.5, we introduced the Delaunay elements for the two-body problem; 
there, we were dealing with a separable periodic system and the Delaunay 
elements L,G,H) were the action-angle variables of the problem. 

However, with the restricted three-body problem we are no longer dealing 
with an integrable system; we can therefore regard the restricted problem 
as a problem generated by adding a second primary (of mass ~j2 in the di- 
mensionless system) to the two-body problem, which consists of a primary 
of unit mass and a third body of negligible mass. In this way, the second 
primary turns out to be a “perturbation” of an integrable system. The ele- 
ments, which before were constant, now vary with time and the differential 
equations which rule these variations can be written immediately, since they 
are just the canonical equations; for instance, the equations for the pair of 
conjugate variables L will be 

dX__dH 

with H given as a function of the constant elements. 

We thus shall introduce the Delaunay elements, always bearing in mind 
the rule that in the limit for /i 0 we must recover what was already been 
established for the two-body problem. Furthermore, it must be emphasized 
that, since the restricted problem is by definition a planar problem, one is 
still confined to the elements (^, L, G). 

The Delaunay Elements in the Rotating System 

Recall Hamiltonian (4.61), which can be rewritten as 

= +pI) - {(I 1 P 2 - q 2 Pi) - U {qi, q 2 ) , (4.73) 



Among others, N. Moisseiev: Sur Thypothese de Gylden-Moulton de Porigine de 
“Gegenschein”. Partie 5 et Partie 6, Astron. J. Sov. Union 15 , 217-225, 226-231 
(1938). 
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by omitting the constant term —JI (1 —Jl) /2, introduced for the sake of con- 
venience in the definition of the function and, in any case, irrelevant to 
subsequent developments. So far, we have worked with rectangular coordi- 
nates (such as Qi and ^ 2 ); we shall now use polar coordinates (while still 
remaining in the synodic system) . The relevant canonical transformation (see 
also Sect. 5.2) will give in our case (t? = tt/2) 

qi = r cos(y9, Q 2 — 'f' sin(/?, (4.74) 

Pr = Pi cos(y9-hp2 sin(p, = -rpi sin(/? + rp2 cosp. (4.75) 

From now onwards we will denote the canonical coordinates Qi — r, Q 2 = 
Pi = p^^ P 2 = p^. The new Hamiltonian will be therefore: 

lC = n = n{r,ip,pr,p^) = ^(pI + ^pI) -P^-U (4.76) 



with _ _ 

TTr ^ ^ P' 

U {r,(p) = H , 

Qi Q2 



Qi = \l{r cos 


ip + + (r sintp)^, 


^2 = \/(^ COS (y9 + /i — 1)^ + (r sin ( 


The equations of motion are consequently 








~ dPi 




r=Pr, 


A dn 




1 








A d'H 




pI dU 


- ncr 
dQi 




Pr = -j + 


■ dn 




dU 


P 2 = 

dQ2 







From the second of equations (4.77),^^ one obtains 



Pv = (< 7 > + 1) = ^ (<p + i) . 



The angle 'ip = ip + t is the angle with respect to the inertial system; hence 
Ip. It now only remains to perform the last canonical transformation 
to obtain the Delaunay elements. Let us call the new canonical variables 
ql, p[ and let W 2 = W 2 {Qi, p() be the generating function. Then 



Pi = 



dW2 

dQi 



Qi = 



dW2 

dp'. 



(4.78) 



^^If we were not in the dimensionless system, then we would have ((^ -h tot). 
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Keeping in mind our purpose, we impose the condition 

P^ = P2-P2^ (4.79) 

From (4.78) and (4.79), one obtains P 2 = p '2 — dW 2 jdip, and integration 
thus yields W 2 — P 2 PP f{Qi'>PiTP 2 )^ with / an arbitrary function. Moreover, 
again from (4.78) 

= (4.80) 

In the particular case ^ = 0 (two-body problem), the energy integral yields 
= 2/r — 1/a. In the synodic system, 

+ {rTp)^ , r‘^jp c = -y/a{l - e^), 

so that + a (1 — e^) /r^ and therefore 

V r a 

We shall rewrite this last relation, in general (with a and e variable), as 



In (4.81) use has been made of (4.79); moreover we have put p[ = ^/a. From 
(4.80) and (4.81), 

a/ \ . 2 T 



dQx V Qi vT 



By integrating this we get 



+ It ■ +9iPl,P2) , 



where ^ is a new arbitrary function. Now 



VF2(Qi, Q 2 , pi, P 2 ) =P2Q2 + y Y ^ (4.82) 

So far we have made use of the first two equations of (4.78). In fact we 
have conjugate momenta P 2 = P 2 = P^ P\ given by (4.81). We must 
now utilize the other two equations, that is, determine q[,q 2 - If we call the 
indefinite integral in (4.82) 7, we can also write 



Qi r r.'2 



^ J_ 
e p? 
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for arbitrary z . The variables still to be determined are 

, dW 2 dl ^ dg , dW 2 _ dl dg 

" dp[ ~ dp[ ^ dp[ ’ • 

We must then find the function g {p[,P 2 )] it is easier, and equivalent, to give 
the lower limit 2 : of the integral / as a function of p[ and p '2 and to put 
g (p'l, p' 2 ) = 0. What remains then is W 2 = P 2 Q 2 + 1, with the lower limit of 
I to be determined. One takes 

2 == p'l (pi - -p' 2 ^') - 

which, since p'l = y/a and p' 2 ^ = a(l — e^), means that z = a(l — e) (distance 
of the pericentre). Evaluating the integral and then differentiating gives 



q'l = arccos 



1 - 



Qi 

t 2 

Vi 



1 - 



q'^ = Q 2 — arccos 



/ 2 

Qi 



p'l 



2 -, - 1/2 



Qi 

Ti 



1 - 



p'l 



2-1 -1/2 



2 p'2^ 1 

Qi Ql p[^ 

(4.83) 



We now have to interpret the new variables. If in the first of equations (4.83) 
we substitute the explicit forms of p'l , p '2 and r we have 



q[ — arccos 



a — r 
ae 



rr 




On the other hand, when a and e are constant (two-body problem), 



—= — e sinu (4.84) 

Va 

\ 

is valid; therefore we can write the above expression for the coordinate q'l 
in the form r ^ a [1 - e cos{q'i + e sinw)]. By substituting (2.50), we finally 
obtain u — q'i~\-e sinw, that is q'^ = u — e sinu, which is nothing other than 
Kepler’s equation for the mean anomaly. Hence 

q[ = t (4.85) 



From the second of equations (4.83), with suitable substitutions, we obtain 



q'^ — ip — arccos 



a (1 — e^) 
re 



r 

e 



^^Differentiating Kepler’s equation yields i = n — u{l — e cosu) = ur/a. As the 
mean motion is n = we get u — \jry/a\ differentiating then the relation 

r = a(l — e cosu), so that r = aeii sinu, gives (4.84). 
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from which 

a (1 — e^) 

r = ^ . 

1 + e cos ((/9 — q' 2 ) 

By comparing this with the equation of the Keplerian orbit (2.21), one has 

^2 = ^ - /• (4.86) 

The second of the new canonical variables is then the argument of the peri- 
centre with respect to the synodic system measured from the qi axis (see 
Fig. 4.8). 




To summarize, the generating function of the canonical transformation 
from polar coordinates to the Delaunay variables is given by 



rQi 

W2 = Q2P2 + / 



P 2 



nl/2 



The transformation is from 

(Qi=r, 

\Q 2 =<f, 

to 

f Qi = 



/^]L 

Pi = r, 

p'l = Va, 



P?i 



d^. 



i 92 = </? - /> P 2 = A/a(l - e2). 
Conventionally, the following symbols are used: 



q[=l, p'l^L, 

92 = 9 , P2=G. 



(4.87) 



We now have to calculate the new Hamiltonian and to write the canoni- 
cal equations. By substituting (4.81) into (4.76), one has, upon calling the 
Hamiltonian in the new variables H, 
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2 

I ^ /9 ' 



01 ' Qi p? ' Qi\ ^ 2 



2 

Qi 






-P2-U. 



Thus 



- I — ^ 11 1 /I 1 

C7 = E + 

^2 V Q2 Ql 



when /x = 0, U = l/pi = l/r = l/Qi,so that 

« 2 pf P2- 



For /i ^ 0 , we write 

where i?, the difference between 1/r and the actual potential [/, is the per- 
turbing function of classical planetary theory. The Hamiltonian for the re- 
stricted three-body problem can then be written 



a- ^ ^ v' 



R 



1 



G-R, 



and the canonical equations are 



&H 


(M 1 


OR 


. / 


dn 


dL 


dR 




^ dt~ 




Pi = 


dq[ 


— — 
dt 


~ de' 


dn 


dg 


OR 


. / 


dii 


dG 


_ dR 


dp'2 


^ = -I 
dt 




P 2 = 


dq'2 


^ dt 


dg 



(4.88) 



(4.89) 



It is easy to check that for R = 0 (two-body problem) the first of equations 
(4.89) will again give us (2.84a) (that is, Kepler’s third law), whilst the third 
and fourth show that a and e remain constant; from the second of these 
equations, we see that g — —1^ that is, g — —t + const., and then (f — f = 
—t + const. This means that the apse line rotates in the (rotating) synodic 
system with unit angular velocity with retrograde motion, and it is then fixed 
with respect to the inertial system. It is therefore verified that (4.89), in the 
limit for 0, again give the equations of Keplerian motion. 



The Delaunay Elements in the Inertial System 

Let us now determine the Delaunay elements for the inertial system. We shall 
call the canonical variables in this system Q-, Pi and the new Hamiltonian 
R; because of what we said above regarding the limit of (4.89) for i? — )► 0, 
it must be the case that, while q '2 = g goes into Q 2 = g (with g = g 
the other elements are left invariant. Then the canonical transformation to 
be performed will be different from the identical transformation only as far 
as q '2 is concerned. In general, 
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W2 = W2 (q[,q^;Pi,P2,t), 



n = n+ 



dW2 
dt ’ 



Pi = 



dW2 

dq[ 



^ dW2 

dPi ■ 



(4.90) 



Taking into account that Q^ = g = g + t = q' 2 -\-t,hy integrating the last of 
equations (4.90) for f = 2 one obtains 

^2 = (?2 + ^) -^2 + V’ (Qi) ?2) -Pi) ^) ) 

with Ip an arbitrary function. Similarly, from Qj = £ — q[ , one obtains 

dW2 dtp 

dPi dPi ■ 

Integration then yields 

V’ (^i, Q 2 > ^ 1 - t) =q[Pi+x {q[, q' 2 , t), 

with X ^ arbitrary function. In summary, 

^2 = (^2 + P 2 + QlPl + X {Qii Q2^ i) • 



Since also 

dx 

dq[^ 

dx 

dq'^ ^ ■ dq'^ 

one sees immediately that x can only be a function of t] let us call this 
function ^{t). Hence 



^2 ^2 ^ , 2 + o^/ 5 



W2=^{q'2+t) P2 + q[ Pi+7(t). 



(4.91a) 



Furthermore, the Hamiltonian H must not be time dependent. From (4.90) 
and (4.91a), we obtain 



P 2 P 



dl jt) ^ 
dt ’ 



the most obvious choice is then 7 = const. , since the other possible choice 
7 = const. X t would give an additive and not essential constant in the 
Hamiltonian. Thus 



W 2 — {q '2 P P 2 P q'l Pi P (4.91b) 

with a an arbitrary constant (an identical transformation would in this case 
have been W 2 = q[ Pi P q '2 P 2 )- Using (4.88) the Hamiltonian is 

n = n + P2 = — K-r = -~-r. (4.92) 

2 Pj 
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For — > 0, (4.92) gives, as it should, the very same 1~L as in Sect. 2.5 (without 

the factor /i^ because of the particular system of units). 

Let us add at this point the consideration that, besides the elements 
(^, g, L, G), various combinations of the Delaunay variables are employed 
according to the case considered, including the exchange of the coordinates 
with the conjugate momenta. We list some of the combinations more fre- 
quently used: 



1) qi — Q2 — G ] 

2) qi = L, Q2 = G - L ] 

3) qi — q2 = L — G] 

4) qi = L — G^ q2 = G ] 



Pi^ P2= 9^ = %] 

Pi=i + g, P2 = g; 

Pi = e-\- g^ P2 = -g] 
Pi = P2 ^ ^ + 



By applying to set (4) the transformation having as a generating function 

p2 p2 

W4 = W4 {pi, Pi) = ~Y cot Pi + - cotp 2 

one obtains the Poincare canonical variables: 



Qi = V2 (L - G) cos£. Pi = V2 (L - G) sin£, 

Q 2 = \/2 G cos(£ + p), P 2 = a/2 G sm{i + g). 



By applying instead to set (3) the transformation having the generating func- 
tion 

Ws = Ws {Qi, Pi) = -Qipi - 2 ^2 tanp2, 

one obtains the following variables (half Delaunay and half Poincare): 



Qi — L, Pi — i -\- 

Q 2 = x/2 (L - G) cosg, P 2 = -a/2 {L - G) sing. 



4.6 The Regularization 

of the Restricted Three-Body Problem 

We now deal with the regularization of the system of order four relevant to 
the restricted problem. We shall formulate the problem in a general manner, 
by extending to non-natural systems the considerations of Sect. 2.6. In this 
way, even if at the end we shall carry out the regularization by means of the 
Levi-Civita transformation, the method can in any case be applied to any 
other regularizing transformation. 

Without choosing dimensionless units, we consider, to maintain full gen- 
erality, (4.48), which we rewrite as follows 
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.. O • 2 dU 

qi-2ujq2-uj qi = — , 
dqi 

.. o . 2 dU 

q2 + 2ujqi-uj q2 = — , 

dq2 

having put x — qi, y — q 2 . In Lagrangian form (4.93) become 
d dC dC 



(4.93) 



dt dqk dqk 



= 0 , 



k = 1 , 2 , 



(4.94) 



with 



^ — T U — - [ql ql) -u;‘^ (g^ T ^ 2 ) + {QiQ2 ~ Q2Q1) + U, (4.95) 

from which it appears that C contains the terms we called To{q) and 7i(g, q) 
in Sect. 1.5. From (4.95), one obviously obtains the momenta Pi — qi — u)q 2 
and P 2 = ^2 + and the Hamiltonian 



'^= ^{pi+pI) {(I1P2 - q 2 Pi) - U, 



(4.96) 



which, if we put a; = 1, coincides with (4.73). Let us now consider, as in 
Sect. 2.6, the complex variable z = qi+i q 2 and the transformation z = f{w) 
with w = Qi + i Q 2 . li we impose the validity of 



dqi dq 2 



dqi 



dq2 



dQi 0Q2 ’ dQ2 dQi ’ 



(4.97) 



then z = f{w) will be a conformal transformation for every point where 
f'{w) ^ 0. The Jacobian of the transformation will be given by 



T ^ d{qi,q2) ^ , 

d{QuQ2) \dQj 
By means of (4.97) and 



dqi 

0Q2 



_d 

dt 



Qi 



d 



Q 2 



dq2 

dQi 



d 



+ (^) = 
\dQ2J 



df 



dw I 
(4.98) 



dQi ' '"^dQ2^ 

the expression for the kinetic energy becomes 



(4.99) 



Furthermore, 



dT 



dq2 



dqi \ 
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d 9T d - ■ 

= -t:{J Q i) -ujJQ2 



dtdQi 



dt' 












+ W Q 2 ( Ql 



dJ 

dQi 

d^q2 



+ loQi I q'l 



Q2J —Q2 

d‘^q2 



92 



dQ\ 

d'^qi 



dQ\^^^'dQidQ2 
d^qi \ , .A 



dQi dQ2 dQi dQ2 

Analogously, one can obtain expressions for 

dT d dT or 



d 

dQi 



+ q|) 



dQ2^ 



dt 0Q2 ’ 9 Q 2 ’ 



which we do not state for the sake of brevity. 

At this point, we can rewrite Lagrange’s equations in the new variables; 
they will be 

t=l,2, (4.100) 

dtdQk dQk ^ ^ 

where C = T-\-U is written using for T expression (4.99) and for U the ex- 
pression obtained by substituting for qi = qi (Qi, Q2) and q2 = q2 (Qi, ^2)- 
By exploiting the derivatives evaluated as above, we get 

d dT dT dU 



from which 
d 



JJQ,)-2.JQ,~ 



dtdQi 

d 



dQi dQi 



^UJ^{qi+ql) 






dJ dU 



dQi 



and also 
d 



Ql) — 2 uj J Q2 = - 



(^qI + Q 2 ) 



dJ d 
+ 



{Qi + ^ 2 ) + ^ 



- {q\ + ^ 2 ) + ^ ^ 



dQi dQi 

and lastly, h being constant, 

|(JQi) -2ujJQ2 = \{qI + QI)^^+~ 

Since, from 

'^ = \{qi+ ql) - ( 9 ? + ql)~u = h, 

one also has 

-ixP' (91 + 92) + C + ^ = 2 ^2) — 2^ (qI + Q2) , 

we can finally write 
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§(JQ,)-2.JQ, 



- u? [q\ + + C/ + /i 



By means of analogous calculations, it can also be shown that 
d 



dt 



{JQ2) + 2loJQi \^^{qf + ql)+U + h 



(4.101a) 



(4.102a) 



If, as in (2.106), we put dt = Jdr, (4.101a) and (4.102a) become 



d^Qi „ jdQ2 



d^Q2 

dr'^ 



2jUJ J 



dQi 

dr 



d 

dQi 

d 

dQ2 



J 



J 



2 (qi + qi) + (d' + h 
(qi+ql) +l7 + h 



j, (4.101b) 
j. (4.102b) 



Equations (4.101b) and (4.102b) are the new equations of motion in place of 
(4.93). They are then the result of two transformations: the first is a conformal 
transformation and contains, so to speak, the geometric information about 
the orbit transformation; the second, which, as we have seen in the two- 
body problem, is the one that is essential for the regularization, controls the 
dynamics of the problem. 

If we put 



J 




(Qi + ^2) + 



= 12, 



(4.103) 



we can also rewrite the equations of motion as 



d^Qi jdQ2 _ dQ 

dr‘^ ^ dr dQi ’ 

(PQ2 r, _ df2 

dr‘^ ^ dr dQ 2 



(4.104) 



From these, one can immediately determine the first integral corresponding 
to the original Jacobi integral (note that the constant h for the Hamiltonian 
used in this section corresponds to —C/2 -\-]l {1 — Jl )/2 in the dimensionless 
system and with the definition of the potential given in Sect. 4.3). In fact, 
from 






dr J \ dr J 



and from (4.103), 



dr J 



dQ2 

dr 



2 

= 212 . 



(4.105) 



So that (4.104) turn out to be regularized, it will of course be neces- 
sary for 12 to be without singularities. We have already seen that the Levi- 
Civita transformation completely regularizes the two-body problem, making 
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the product JU a constant in the case where the denominator of U van- 
ishes in the origin, where the centre of attraction is located. In the restricted 
problem, things are different, because U consists of two fractions, and conse- 
quentely there will be two possible singularities, corresponding to collisions 
with each of the two primaries separately. By means of the Levi-Civita trans- 
formation we can therefore regularize either singularity, choosing which pri- 
mary we should deal with first. This will be then a local regularization (the 
term is due to Birkhoff). Furthermore, a translation should be performed as 
well, because the primaries are not located at the origin of the coordinate 
system. From these considerations it seems that if we want to regularize the 
equations in view of a collision with the primary of mass mi and coordinates 
(xi, 0), we must put: 

qi~xi=Q\-Ql, q2 = 2Q, Q 2 . (4.106) 

The corresponding term in U will be 

Gmi 

W+W)' 

and multiplying this by J = 4 (Qf -h will give a constant. 

In the case where one of the primaries has a mass much greater than 
the other one (e.g. Sun- Jupiter-asteroid), by regularizing the term of the 
potential relevant to this primary a la Levi-Civita, one is able to schematize 
the problem as a (regularized) two-body problem perturbed by the presence 
of the smaller primary. One can therefore end up regarding the problem as a 
problem of a perturbated planar oscillator. 

It is clear that, to handle a real problem, that is, in the case of a numerical 
solution of a system which refers to a definite physical example, Levi-Civita 
regularization completely fulfils the requirements. To answer, instead, the 
question concerning the existence of the solution, global regularization is more 
appropriate, that is, the use of a transformation which cancels at the same 
time both singularities of the potential U. Regularizations of this sort have 
been suggested by various authors (Birkhoff, Thiele, Lemaitre, Arenstorf, 
etc.). They are, as one could expect, more complicated transformations whose 
explanation can turn out to be very long. We therefore refer the reader, in 
addition to the celebrated paper by Birkhoff, to the textbooks by Szebehely 
and Hagihara.^^ 



^^See, for instance, D. A. Pierce: A solution of the regularized equations of motion 
of the restricted problem, Astron. J. 71 , 545-561 (1966); Application of a solution 
of the regularized equations of motion of the restricted problem, ibid. 71 , 562-565 
(1966). 

D. Birkhoff: The restricted problem of three bodies. Rend. Circ. Mat. Palermo 
39 , 265-334 (1915). 

Szebehely: op.cit.. Chap. 3; Y. Hagihara: Celestial Mechanics (Japan Society 
for the Promotion of Science, 1975), Vol. IV, Chap. 17. 
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4.7 Extensions and Generalizations 
of the Restricted Problem 



A great many of the bodies in the solar system have a very small inclination 
with respect to the plane of the ecliptic and in most cases the eccentricity of 
their elliptic orbits is also very small; the restricted problem can be regarded, 
therefore, in a first approximation, as a mathematical model admitting of real 
applications. Systems such as Sun-Earth-Moon or Sun- Jupiter-asteroid, can 
be handled successfully as physical realizations of the restricted three-body 
problem. Obviously, to make more accurate forecasts, one must take into 
account that the motion of the third body does not occur precisely in the 
plane where the primaries move and moreover that the orbits of the primaries 
are not exactly circular. For these reasons, although at first sight it may look 
inconsistent, it is more appropriate to consider certain problems as extensions 
and generalizations of the restricted problem, rather than as particular cases 
of the general three-body problem. From what we have said, it turns out that 
the extensions we can make are of two kinds (which may even be present at 
the same time): 

a) to consider the primaries always moving in the same plane, but with orbits 
which are in general conics and not necessarily circles; 

b) to consider the third body moving in (three-dimensional) space and not 
confined to the plane where the primaries move. 

Proceeding step by step, let us deal first with the extension deriving only 
from point (b). 



The Three-Dimensional Restricted Problem 

In this case, (4.50) are replaced by 

where the function ^(x, z) is still given by (4.49), but with 

(7 = 1^ + ^, (4.108) 

^1 Q2 

where 

Qi = \j{x + Jif + y‘^ + z'^ , Q 2 = + + y^ + z‘^, 

now containing the coordinate 2 ;. For (4.107), the Jacobi integral still exists: 
2 ^ {x, y, z) - +iP + z^) — C. (4.109) 
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The Hamiltonian form is obtained by extending (4.61), (4.62) and taking as 
the Hamiltonian 

= +pI+pI) - {<hP2 - q2Pi) + ^ {ql + ql) 92, qs) ( 4 .iio) 



and adding to (4.62) 



Qs =P3, 



(4.111) 



with qz = z. 

For the Delaunay elements in three-dimensional space, we will still have 
Hamiltonian (4.92) in the inertial system, with R = R{1, g, h; L, G, H), where 



H — a {I — e^) cosi, h — fi. 



(4.112) 



Also the combinations of the Delaunay elements listed at the end of Sect. 4.5 
are generalized as follows 

2) qi=L, Q 2 ^G-L, q 3 =H-G, 

Pi = e + g + h, P2^ 9 + h, P3 = h; 

3) qi = L, q 2 — L — G, qs = G — H, 

Pi=i + g + h, P 2 = -(g + h), p3 = -h; 



4) qi=L-G, 
Pi = 



q2^G-H, 
P2^^ + g, 



qs=H, 

P3 = i + g + h. 



Lastly the Poincare canonical variables are extended to three-dimensional 
space through 



Qi = ^/2{L - G) cos£, 

Q 2 = V2 (G - H) cos{£ + g), 
Qs = H, 

and, analogously, the mixed ones are 



Pi = V2(L^^ sin^, 

P2 = V^ {G-H) sin(^T^), 
P^ — i P g 

Pi = i P g P h, 



Qi — T, Pi — i P g P 

Q 2 = \/2 (L - G) cos {gPh) , P 2 = ~ \/2 (L - G) sin {g P h) , 



Qs = y/2 (G - if) cos h, 



Ps = -^/2{G-H) sin h. 



If now, as in Sect. 4.3, we consider the manifold corresponding to the states 
of motion of the system, its singular points will be given by the equations 
X = y = z = 0, d^jdx = d^jdy = d^Pfdz = 0. The explicit expression of 
the z component of the gradient is 
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from which, the quantity in square brackets certainly being positive when 
P 3 is at a finite distance, one deduces that z = 0. Therefore, the equilib- 
rium points still lie in the xy plane: the Lagrangian points will have the 
same positions Li, L 25 ^ 3 , ^ 4 , ^5 as in the planar case. However, an essential 
difference exists regarding the stability of the equilateral solution. In the pla- 
nar problem, points L 4 and L 5 represent positions of stable equilibrium; by 
approximating a physical system (Sun- Jupiter-asteroid) to a planar system 
and taking into account the stability of the equilateral solution, we concluded 
that the physical system itself was stable. At this point, one might think that 
this conclusion is not valid, owing to the approximation. However, as Little- 
wood did in the case of the planar problem, one should instead ask whether 
the system is stable on a finite time scale, not for eternity. The answer is that 
the equilateral solution in the spatial problem is stable for a time which may 
be of the order of the age of the solar system. One then speaks of effective 
stability. 

Now, the locus of the zero velocity points is defined by 

${x, y, z) = ^-, (4.113) 

it therefore consists of a family of surfaces embedded in three-dimensional 
space. Since both y and z appear only as squares in (4.113), there will be 
symmetry with respect to the xy and xz planes. The intersection with the 
xy plane will give rise to the zero velocity curves we have studied in Sect. 4.3. 
We saw there that, in the xy plane, the function ^ (x, y) attained its mini- 
mum value for (7 = 3. Now instead there are zero velocity surfaces for C < 3 
which do not intersect the xy plane. For a detailed study, we refer the reader 
to the textbook^^ by Szebehely. Among the possible orbits in the restricted 
three-dimensional problem, there is a remarkable particular case:^^ the case 
of (one-dimensional) motions along the z axis. The importance of this case 
resides in the fact that, as we shall see, the system becomes equivalent to a 
real system with only one degree of freedom. It is the only exact solution, 
besides the equilibrium configurations, of the restricted three-body problem. 
The extension to the case where the primaries move on elliptic orbits then 
gives rise to the so-called Sitnikov problem^ which we have already mentioned 
(Sect. 3.5, Footnote 37). First of all, let us see what are the conditions to be 
satisfied for the motion to occur along the z axis. From (4.107), by imposing 
the condition x{t) — y{t) = 0, Vt, we have 

^^See, for instance, A. Celletti, A. Giorgilli: On the stability of the Lagrangian points 
in the spatial restricted problem of three bodies, Celestial Mech. and Dyn. Astron. 
50, 31-58 (1991). 

Szebehely: op. cit., Chap. 10. 

Pavanini: Sopra una nuova categoria di soluzioni periodiche nel problema dei 
tre corpi, Ann. Mat. Serie III, 13, 179-202 (1907); W. D. Mac Millan: An inte- 
grable case in the restricted problem of three bodies, Astron. J. 27, 11-13 (1911). 
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dx ' dy ' dz ' 

From the first of these, we obtain ^i(0, 0,z) — ^ 2 ( 0505^)7 ^nd, as a conse- 
quence, 77 = 1/2. Hence, the masses of the two primaries must be equal 
and therefore also at the same distance from the origin where the barycentre 
is located. The circular orbits of the two primaries will then coincide (see 
Fig. 4.9). 




In our dimensionless system, the distance of the masses from the origin 
will also be equal to 1/2, so that the third of equations (4.114) is reduced to 







(4.115) 



which is the equation of motion of a system that has only one degree of 
freedom. For the Jacobi integral, we have 



i' 



^2^-C = 




(4.116a) 



with C = (7 — 1/4. Equation (4.116a) can be rewritten in the form 

r-^ = E, (4.116b) 



with E = —(7/2; it is evident from (4.116b) that E represents the total 
energy of the third body. Let us now see what information (4.116b) can 
provide. For E’ > 0, the right-hand side of (4.116b) never vanishes, and then 
P 3 will always move in the same direction, depending on the initial velocity. 
In any case, it will move indefinitely far away from the initial position at the 
time t = 0. Therefore, positive energy values cannot correspond to bounded 
motions (see Jacobi’s stability criterion. Sect. 3.5). Since the potential energy 
term has values between 0 and 2, for E < —2, (4.116b) can never be satisfied 
by real values of 2: and i. 
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If we consider — 2 < E’ < 0, two values of z exist, which we shall call zq 
and —Zq, with 

_ V4-E2 
“ 2 \E\ ’ 



giving i = 0. These two values for z correspond to the maximum and min- 
imum values of the function z{t) for —2 < E < 0, as can be seen from the 
sign of the second derivative, z{zq) = —zq \E\^. Between —zq and zq, the 
function z{t) is always increasing or decreasing, and so the motion is inverted 
in the extreme positions only. Moreover, for E = 0, one has that Zq and —zq 
equal +oo and — oc respectively (for z ^ oo, z 0 and the motion is of 
parabolic type; see the table at the end of Sect. 3.5). 

For E = —2, the extremes reduce to zero; then, in this case, the barycentre 
is a position of stable equilibrium for P 3 . Owing to what we have said, the 
motion on the segment [—Zq, Zq] turns out to be periodic, and the period 
will be given by 



T = 2 




dz 



To evaluate T, it is convenient to carry out a change of variable, by defining 



1 _ 1 



u = 1 corresponds to z = 0 and u = \E\/2 corresponds to z = zq. For T we 
have then the expression 



T = 



L 



\E\/2 



du 

v/(l - uf {u EeJ^ 



(4.117) 



A further change of variable, with 



1 — u 2 It E j2 

^ ITP/2’ ^ 2 



then provides us with the possibility of transforming (4.117) into a complete 
elliptic integral in Legendre form of the third kind. The series expansion of 
the result is given by 



The solution of the problem can be obtained only by inverting the elliptic 
integral corresponding to 



t - to 




du 

4^2 ^(\-uY{uEEI2) 
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and then expressing the variable z as a function of u obtained by means 
of elliptic functions. From (4.118), by making E tend to the limiting value 
—2 (and thus k to zero), one has T = Therefore, when the motion 

degenerates into an equilibrium position, the period tends to the finite limit 
7 t/>/2 . The same result is obtained by linearizing the equation of motion 
(4.115); in fact, if we consider motions occurring near the origin by putting 
2; = C (C is a small quantity), we obtain, neglecting second-order terms 

C + 8C = 0, 

that is the equation of an oscillator with period 7 t/\/2. 

Furthermore, it has been shown by Pavanini^^ that, assuming for the pri- 
maries masses which differ by a small quantity, one still obtains, for energies 
not very different from E = —2, periodic solutions; they consist of closed 
orbits no longer confined to a segment of the 2: axis, although very close to 
it. 



Regularization of the Three-Dimensional Restricted Problem 



Let us now see what kind of difficulty arises when the study of the regulariza- 
tion of the planar case (Sect. 4.6) is extended to the three-dimensional case. 
We have already mentioned (both in Sect. 2.6 and in Sect. 4.2) that the Levi- 
Civita transformation cannot be extended to the three-dimensional case. It 
has been demonstrated^^ that a matrix endowed with the properties of the 
matrix L{u) of Sect. 2.6 can be only 2 x 2, 4 x 4 or 8x8. Kustaanheimo^^ 
worked out the idea, by analogy with spinor theory, of considering four dimen- 
sions by using column vectors with the fourth row equal to zero. If qs 

are the (Cartesian) coordinates of the third body (m3 ~ 0), we introduce 
the four parameters ui,U 2 ,us,U 4 , three of which will be the new coordinates. 
We then define the K-S matrix: 



K{u) = 



Ui 


-U2 


-U3 


U4 \ 


U2 


Ui 


—U4 


-U3 


U3 


U4 


Ui 


U2 


U4 


-U3 


U2 


— Ui / 



and the relevant transformation 



/Qi\ 
Q2 
qz ^ 

Vo/ 



Ui 


-U2 


-U3 


U4 


\ 




/Ui 


U2 


Ui 


—U4 


-U3 






U2 


U3 


U4 


Ui 


U2 


/ 




U3 


U4 


-U3 


U2 


-Ui 


\U 4 



(4.119) 



(4.120a) 



Pavanini: loc. cit., 2nd part. 

^°The result, later on generalized by Jordan, Von Neumann and Wigner, is due to 
A. Hurwitz: Mathematische Werke II, (Birkhauser, Basel, 1933), pp. 565-571. 
^^See P. Kustaanheimo, E. Stiefel: Perturbation theory of Kepler motion based on 
spinor regularization, J. reine angew. Math. 218 , 204-219 (1965). 
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In compact form this is 

q = K{u) u. 

In terms of components, we have 

qi=u\-u%-u\ + u\, 
Q 2 = 2 (ui U 2 - U3 U4) , 

93 = 2 (ui «3 + U 2 U4) , 

94 = 0. 



(4.120b) 



(4.120c) 



As for matrix (4.119), the orthogonality property still applies 

(u) K {u) = \uf 1 , K~^ (u) = K'^ (u) . 

Moreover, properties (b) and (c) of Sect. 2.6 still apply, whereas the general- 
ization of (2.97) requires a little more care. If we generalize it by writing its 
differential, we shall have: 



dq — 2K (u) du. (4.121) 

The three equations obtained from the first three components of (4.121) give 
us total derivatives, which, upon integration, give (4.120c); the fourth, on the 
other hand, unlike (4.120a), does not give an identity but an equation: 

u^dui — Usd U 2 + U 2 d Us — u\du^ = 0, (4.122) 

which must be satisfied by the differentials of u. Consequently, whilst the 
transformation ix — > g is given by (4.120a), when the inverse transformation 
q u is performed, (4.122) must also be satisfied. 

From (4.120c), by squaring, adding and taking the square root, one gets 

u\+ul + u\ + ul = ^Jq\ + ql + ql=r, (4.123) 

where r is the radial distance in the three-dimensional “physical” space. Also 

dql + dql + dql = Ar [d uf d u^ d ul d u^) , 

if (4.122) applies. The independent variable is introduced afterwards in the 
usual way by writing dr — dt/r. 

Let us now see how the transformation works. From (4.120c), one can 
immediately see that the points of the U\U 2 plane of the parametric space 
(which we shall call U^) are transformed into points of the q\,q 2 plane of the 
physical space following the Levi-Civita relations (2.94a). Furthermore, if 
a point u of is transformed into q of M^, every point v which satisfies 



vi = Ui cos (f — U 4 sm 
V 4 = ui sin (p -\-U 4 cos (/?, 



V2 = U2 COS (p ^Us sin (p, 
Vs = —U2 sin (p -^us cos (p, 



(4.124) 




4.7 Extensions and Generalizations of the Restricted Problem 291 



where Lp is an arbitrary angle, is also transformed into the same point q. 
The image of a point in is therefore a circle with radius ^/r in the 
parametric space U^. The tangent vector to this circle, which is obtained by 
differentiating (4.124) with respect to (p and evaluating the result at 99 = 0, 
has as components — 1 ^ 4 , 1 ^ 3 , —U 2 ^ ui. Therefore, (4.122) commands the vector 
du originating from the point u to be orthogonal to the circle with radius 
^/r passing through that point. This circle is said to be a fiber of and 
(4.122) defines the fibration of the parametric space Two different fibres 
never intersect, because they correspond to two different points of R^. 

If we now extend (4.122) to any two vectors u, v which define a plane 
R^ in that is, we require 



U 4 V 1 - UsV 2 + U 2 V 3 - U 1 V 4 = 0, (4.125) 

it can be demonstrated that the plane R^ is conformally transformed into a 
plane of R^, and moreover the transformation is of the Levi-Civita type,^^ 
meaning that it doubles the angles at the origin and squares the distances. 
Let us now look at the inverse transformation. 

From the first of equations (4.120c) and from (4.123), 

ul + ul = ^{qi+r). (4.126) 



Obtaining U 2 and U 3 from the last two equations of (4.120c) and by substi- 
tuting (4.126) into them, one gets 



U2 



Q2 ui + qs U 4 
qi+r 



U 3 



qs ui - Q2 U4 
qi+r 



(4.127a) 



These relations are convenient for the case when qi >0. For the case where 
qi, on the other hand, is negative, the following are convenient: 






ui = 



Q2 U2 + qs U3 
r-qi 



U 4 = 



qs U2 - q2 Us 
r -qi 



(4.127b) 



The Elliptic Restricted Problem in the Plane and in the Space 

We shall confine ourselves here to a short outline. The planar elliptic prob- 
lem, with respect to the circular one, shows a fundamental difference: the 
Jacobi integral no longer exists. The same occurs, obviously, for the spatial 
elliptic problem. The Lagrangian solutions, however, remain, as one would 
expect from the study of the general three-body problem. As the orbits of 
the primaries are elliptic, the rotating system which one has to refer to, in 
order to maintain the primaries in fixed positions, must be a system which 
rotates uniformly and with axes which expand and shrink. We shall not en- 
ter into the details of the calculations and we only remark that, in order to 

^^See E. L. Stiefel, G. Scheifele: op. cit.. Chap. XL 
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have dimensionless variables, the distance of the primaries, as in ( 2 . 21 ) with 
( 2 . 22 b), 



r{t) ^ 



a (l — e^) 

1 + e cos / ’ 



(4.128) 



will be used as unit lenght (a and e are the semimajor axis and the eccentricity 
of the ellipse that each primary describes around the barycentre and / is the 
true anomaly of that orbit). A particular case which has drawn considerable 
interest is Sitnikov’s problem, (see Fig. 4.10), as has already been mentioned. 




The two masses mi — m 2 move along elliptic orbits around the barycen- 
tre located at the origin of the reference system. The third body, with mass 
m 3 ~ 0 , moves along the straight line (z axis) orthogonal to the plane of 
motion of Pi and P 2 and passing through the barycentre. The equation of 
motion along the z axis is 

(P z _ z 

^ ~ ~ [z2 + r2 {t)f^ ’ 

with r{f{t)) given by (4.128). The Hamiltonian 

P = - 2 1 

2 ^ y/r‘^ (t) + z^’ 

with p = z, is a function of the time and, as we have already said, no 
global integral exists. Sitnikov demonstrated the existence of solutions that 
are unbounded but pass an infinite number of times through the origin. The 
problem has also been studied by Moser,^^ who demonstrated the existence 



Moser: Stable and Random Motions in Dynamical Systems, (Princeton Univer- 
sity Press, 1973), Chap. Ill, Sect. 5. 
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of chaotic orbits, and by Wodnar,^"^ who confirmed Sitnikov’s results through 
a more rigorous mathematical analysis. 



Hill’s Problem 



We mentioned at the beginning of this section the Sun-Earth-Moon system 
as an example of the application of the restricted problem. We shall now try 
to explain, in broad outline, that particular version of the restricted problem 
known as Hill’s problem and which is the basis of Hill’s lunar theory,^^ later 
improved by Brown and others. 

The starting point consists in the equations of the restricted problem, 
(4.50) and (4.51). This means that one assumes that the Moon’s inclination is 
equal to zero and the orbits of the Sun and the Earth around their barycentre 
are circular. In addition, let us translate the y axis, by putting the origin of 
the rotating system at the position occupied by the body of mass JI (in this 
case the Earth) . If we call the new coordinates of the third body (the Moon) 
X = X — {1 —Jl) and y = y, the function ^ will be given by 



^={1- y) x+- (x^ + ^) + 



1- y 



-h 






if 












up to the constant terms which from now on we shall delete since they are 
irrelevant. Let us further simplify our model by considering the Earth-Moon 
distance small with respect to the Earth-Sun distance: we shall eliminate 
from the equations of motion powers of x and y higher than one. This is the 
same as suppressing in the expansion of ^ powers greater than two. What 
remains is _ _ 

still up to constant terms. Furthermore, since the mass of the Earth is far 
smaller than the mass of the Sun, we can also neglect the first term and 
consider _ 

. (4.129) 

Equations (4.50), in the coordinates x and y, are now: 



x-2y 






y + 2x = 



dy' 



with ^ given by (4.129). If we operate a change of scale defined by 



X— >q;x, y ay ^ fi ^ j3 fi^ I — /i^/3(I — /x), (4.130) 



^^K.Wodnar: The original Sitnikov article - “New Insights”, Master Thesis, Uni- 
versity of Vienna, 1992. 

W. Hill: loc. cit., in Footnote 13, Sect. 4.3. 
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with 



a 






= 



1/3 



f3 = 



1 



(4.131) 



1-jij - 1-r 

and again use the symbols x and i/, we obtain just the system (4.50), with 
^ reduced to the form: 






-h 



1 






(4.132) 



The system has the Jacobi integral (4.51). 

If now we check what the relative equilibrium solutions for the problem 
schematized above are, we see that the equations 



— =3x y=0, — 

ox (x‘^-\-y‘^)^ 

admit as unique solutions the two points 



y 



(x2 + 2/2) 



3 

2 



= 0 



(4.133) 




at these points 

^ = ^crit = ^^. (4.134) 

The equilateral solution therefore no longer exists, and in addition one of 
the three points of the collinear solution (Li) has disappeared. This is a 
consequence of our having considered x and y small and Pi (the Sun) placed 
at an infinite distance (solar parallax equal to zero). A calculation similar to 
that performed in Sect. 4.4 then leads us to conclude that the two points A 
and B are positions of unstable equilibrium and the critical value (4.134) is 
neither a maximum nor a minimum. Hill’s region, that is, the region where 
the motion is allowed, defined by 



3x^ -\- 



2 

\/x2 + 2/2 



- c > 0, 



is symmetric with respect to the x and y axes. For C < 0 , it coincides with 
the whole xy plane. For C > 0, however Hill’s region is included between two 
asymptotes parallel to the y axis and with abscissae x — A: \fCj3 ; depending 
on whether C greater, equal or smaller than 3^/^ (= 2^crit), one then has 
the situations shown in Fig. 4.11, 4.12 and 4.13 respectively.^^ 

When C > 3^/^, the motion can occur either inside the oval around 
the origin or inside the two infinite regions included between the curves and 
the asymptotes. For the motion of the Moon, the region concerned is the oval 
around the origin for C > 3"^/^. Hill’s problem can be regularized by applying 
the Levi-Civita transformation, according to the method worked out in the 



^®For a detailed discussion, see V. Szebehely: op. cit.. Sect. 10.4. 
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previous section. One puts z = x iy^ w = u + iv, with z = f{w) defined 
by X — V? — and y = 2uv. For the independent variable 



dr = 



dt 



4 ’ 

The following equations are obtained: 



with 4 {u^ + = J = I/' (ic)p. 



+ 8 {u^+v^) u' = ^, 



(4.135) 



with 

i? = l/'l^ = 6 + 4 - 2 C . 

The integral which corresponds to the original Jacobi integral is given by 

i (m' 2 + u'2) - J? = 0. (4.136) 

From (4.135) and (4.136) it is easy to see how the singularity at the origin has 
been eliminated: i?(0, 0) = 4, ^2^t(0, 0) = i?^;(0, 0) = 0, and the velocity at the 
origin has modulus \/8, as can be obtained from the Jacobi integral (4.136). 
The extremely simple form attained by the regularized problem allows us to 
study the behaviour of the solution of (4.135) near (0,0), that is, near the 
Moon-Earth collision. 

If u{0) = r’(O) = 0 are the initial conditions for the coordinates at the 
time t = r = 0, since the modulus of the velocity at (0, 0) is \/8, we have 



i;'(0) = Vs sin a. 



tx'(O) = Vs cos a 
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where a is the angle of the velocity with respect to the x axis. From (4.135), 
ix"(0) = 0, r’"(0) = 0 and, by taking and calculating their derivatives at 
(0,0), we get 

7 x'"( 0 ) = —4C u'(0) = —4C \/8 cos a, 

'r’"'(0) = —4Cv^(0) = —4(7 a/s sin a. 



Hence 



u — 

V = 



(^Vs cos T — C \Zs cos a) + • • • 

^a/S sin ^ ~ ^ ^ ™ a) + • • • 



and also 

t = 4 f + i;^) dr = ^ (l - ^ (7 H ) . 

7o 3 5 

The solution xx(r), v{r) therefore admits an analytic continuation that is 
real and unique beyond r = 0. 

Hill’s lunar theory, radically changing the paradigm, which from Lagrange 
to Delaunay consisted in representing the orbit of the Moon as resulting 
from the perturbation of a Keplerian ellipse, assumes as its starting point a 
particularly simple periodic orbit. Going back to (4.131), (4.132), let us look 
for a periodic solution with initial conditions 



x(0)>0, 2/(0) =0, x(0) = 0, 2/(0) >0; 



that is, at the time t = 0 the Moon is in conjunction with the Sun and has, 
in the rotating system, a velocity perpendicular to the line joining the Earth 
with the Sun. Furthermore, the solution must have the symmetry properties 



x{t) = x{—t) 



x[t4- 



y(t) = -x{-t) = -y ( t + - 



where we have denoted the period by T — 2'kIuj. This particular solution is 
called variational orbit. In terms of Fourier series, we can write 



oo 

X = {An COS nut + Bn sin nut ) , 
n=0 
oo 



y 



= COS nut + Dn siunut ) , 



n=0 



and, because of the symmetries imposed, we also have = (7n = 0, Vn, 
and all the terms of even indices equal to zero, that is. 



X = Ai cos ut + As cos 3ut -)-•••, 
y = Di sin ut + Ds sin 3ut + • • • , 

which, by putting A 2 i-\-i = 4- a_i_i, ^ 2^+1 = ai — we can rewrite 

as 
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-= E 



ttn cos (2n + 1) Cc;t, y= ^ an sin (2 n + 1) 



The solution depends on the sole parameter lo , having previously eliminated 
Ji by means of a scale transformation. Usually one puts Ifuj — m (Hill’s 
parameter) and then: 



E , - / 2 77/ T 1 . 

an[m) cos ( — — t ) , 



n= — oo 
oo 

2/ = ^ an{m) sin 

n= — oo 



m 

2 77/ “h 1 
777/ 



(4.137) 



In the system that is not dimensionless, 



UJ = 



n — n 

77/' 



m — 



n — n' 



The observed value of n' /n (the ratio between the mean motion of the Sun 
around the barycentre of the Earth-Sun system and the mean motion of the 
Moon around the Earth) gives m = 0.0808489338... . 

By substituting (4.137) in the equation of motion (4.50), (4.132), one 
obtains a system of infinite non-linear algebraic equations for the infinite un- 
knowns given by the coefficients of series (4.137), depending on the parameter 
m. Hill demonstrated that this system admits a unique solution, at least for 
small values of m. The value m = 0.0808489338... is included in the allowed 
values. For the coefficients a^, one has the series expansions 



ao 



777/ 



2/3 






ai 


^ 2,1 3 I 


a_i 


19 2 




= — m + - m + ■ ■ • , 




= - 


ao 


16 2 


ao 


16 




25 4 


d-2 


4 . 




= 7^ m + • ■ • , 




= 77Z H . 


ao 


256 


ao 



The solutions corresponding to different values of ttT/, determined by Hill, 
are represented in Fig. 4.14. The first (inner) orbit corresponds to tti = 
0.0808489338... and the last (outer) one to m = 0.5609626.... The first is 
the only one that is completely contained in the zero velocity oval. As can be 
inferred from the above expansions for the coefficients, it has the equations 



Xq (t) = cos — , 
m 



7/0 (t) = m 



= sin — 



m 



which represent the motion of the Moon if the influence of the Sun is com- 
pletely neglected. The coefficient comes from third Kepler’s law. It is 

evident that, for small values of ttT/, the circular orbit is only slightly deformed 
by the presence of the Sun. 
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Actually, as we know, the real motion of the Moon is very complicated^^ 
because of the presence of various perturbations (including the effects of the 
Earth’s equatorial bulge). The lunar theory so far outlined concerns exclu- 
sively that which has been called by Brown the main problem^ that is, a 
theory which takes into account only the presence of the three bodies Earth, 
Moon and Sun, and, in addition, considers them to be point masses. However, 
in this case, the presence of the Sun also gives rise to motions of the apsides 
and nodes. 




We shall not deal with the last type of motion, but continue to consider 
the problem as a planar problem (otherwise one should consider a pertur- 
bation of an equation analogous to the third of equations (4.107), decoupled 
from the other ones). As to the motion of the perigee, we consider a displace- 
ment (a small first-order quantity) from the periodic orbit; calling the values 
corresponding to the periodic orbit itself xq and yo, we have 

X == xo + fe, 2/ = yo + Sy, 

and we shall neglect the squares of Sx and Sy. By varying (4.131), to first 
order we obtain 



^^A curious description of the motion of the Moon was written by Brown for the 
Encyclopedia Americana (1949 edition, Vol. 19, p. 425). The entire passage can 
be found in S. Sternberg: Celestial Mechanics, Part I, (W. A. Benjamin, 1969), 
pp. XIV-XVI. 
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5x 



2^6y 

dt ^ 



-S,j + 2-Sx = 



8^ 

dx“^ 

d^d> 



r f \ 
0 \dxdy) 



Sx + 



^dxdy 

and, by varying the Jacobi integral (4.51), (4.132) 
d 






Sy, 



^y, 



dt 



Sx + 



d , fd^ 

laa; 



Sx - I — 

oy 



Sy = 0, 



where (• • *) |o ^he derivatives evaluated on the periodic orbit. If, instead 

of the variations Sx and Sy, we consider the variations ST and SN in the 
tangent and the normal to the orbit respectively, given by 

ST = Sx cos^p + Sy sin-^, 

SN = —Sx sin^p + Sy cosxp 



{^ip is the angle between the tangent and the x axis), one succeeds, with some 
manipulation, in obtaining an equation like 

d^ 

— JTV + eJAT-O 
dt^ 

Equation (4.138) is Hill’s famous equation and one can show that the 
function 0 is an even periodic function with a period which is twice 27 t/u;.^^ 
We confine ourselves to mentioning the fact that, if we represent the function 
O by means of a series of cosines, the solution of (4.138) turns out to be the 
solution of a system of an infinite number of homogeneous linear equations. 
Hill extended to this system the rules which are used for the solution of finite 
homogeneous linear systems and introduced (a bold idea in those days) an 
infinite determinant, that is, the determinant of a matrix with an infinite 
number of rows and columns. 



(4.138) 



addition to the already quoted papers, see G. W. Hill: On the part of the motion 
of the lunar perigee which is a function of the mean motions of the Sun and Moon, 
Acta Math. 8, 1-36 (1886); E.W. Brown: An Introductory Treatise on the Lunar 
Theory (Cambridge University Press, 1896). 




Chapter 5 

Orbits in Given Potentials 



The A/'-body problem presents unsurmount able difficulties even for N greater 
than three. It is clear, therefore, that, when N is big enough, the problem is 
definitely unmanageable, if considered from the point of view of the study of 
a dynamical system with 3N degrees of freedom. A possible way out of this 
impasse is to introduce the mean field potential generated by many bodies 
and to study the motion of a single particle, with the reasonable hypothesis 
that it does not appreciably perturb the external field. Further simplifying 
assumptions, like that of stationary equilibrium in the average distribution 
of the members of the system and its structural symmetry properties, cir- 
cumscribe a dynamical problem with three degrees of freedom that presents 
many interesting aspects, especially with regard to its applications in galactic 
dynamics. 
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5.1 Introduction 

In this chapter we shall study the orbits followed by a point particle subject to 
a force field arising from a given external potential. Unless otherwise specified, 
we shall assume that the motion of the particle occurs in Euclidean three- 
dimensional space, in which a potential function^ ^(r), r G is defined. 
The problem of the motion in a given potential (with two or three degrees 
of freedom), as we we have already remarked many times, is of fundamen- 
tal importance in analytical mechanics since, with regard to symmetries or 
other properties exhibited by ^(r), one tries to establish whether the asso- 
ciated dynamical system belongs to the class of integrable systems. In the 
treatment that follows, we also wish to lay emphasis on the astrophysical 
applications of this problem: if ^(r) is the gravitational potential generated 
by a suitable mass distribution, the trajectories of the motions associated 
with it are a fairly good approximation of the orbits covered by the stars in 
a galaxy (and also, with almost the same accuracy, by a star in a globular 
cluster or by a galaxy in a large galactic cluster). The justification for this 
important property lies in the fact that the potential determining the orbit 
of a typical star in a galaxy is the mean field generated by the overall mass 
distribution, whereas the presence of the single stars themselves manifests 
itself as a slight perturbation, the effect of which becomes important only 
on a time-scale much longer than the average orbital period. The rigorous 
estimate of these time-scales is obviously impossible in general for systems 
composed of a large number of bodies {N ~ 10^ — 10^ in globular clusters 
and N ^ 10^^ — 10^^ in ordinary galaxies). An approximate method of eval- 
uation is based on the analogy between the self-gravitating A'-body system 
and a particle gas:^ the trajectory of a given test particle is changed by the 
deflections and the accelerations deriving from subsequent interactions with 
neighbouring particles. We can therefore introduce a time-scale after which 
the test particle has “forgotten” its initial motion, defining it as the time 
such that the quadratic variations in the velocity, due to the perturbations, 
sum up to reach the order of magnitude of the square of the initial velocity. 
In this very simplified picture, a parallel “braking” effect (the so-called dy- 
namical friction) must be combined to the above random acceleration effect, 
so that the whole matter can be seen as the application of Langevin’s theory 
of Brownian motion to the A-body gravitational problem.^ In the case of 
binary interactions this time-scale is 

^ In this chapter we adopt the customary notation of works on stellar and galactic 
dynamics, so that we introduce the potential ^ = —U and use the symbols E for 
the total energy and L for the angular momentum instead of h and c. 

^ S. Chandrasekhar, J. Von Neumann: The statistics of the gravitational field arising 
from a random distribution of stars, I and II Astrophys. J. 95, 489-531 (1941); 
97, 1-27 (1943). 

^ See, e.g., W. C. Saslaw: Gravitational Physics of Stellar and Galactic Systems 
(Cambridge University Press, Cambridge, 1985), pp. 3 ff. 
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Th 



N 



(5.1) 



where ra is the dynamical time of the system, which means the average cross- 
ing time of the system for some of its typical members, or, more rigorously, 
the quantity (having the dimensions of a time) ^jDja^ where D character- 
izes the typical extension of the system (and can be identified, to orders of 
magnitude, with the f appearing in the virial relation of (3.73)) and a is the 
average acceleration felt by the typical member of the system. It has been 
suggested by many authors that the above estimate might be modified to 
take into account the strong inhomogeneity of a self-gravitating system and 
therefore of possible collective effects. In any case, the various models devised 
always give relations between the time-scale and the dynamical time of the 
kind 

r ~ iV^Td, (5.2) 

where 1/3 < a < 1. Since 7*7 is a big number, we see that the claim made 
that the time necessary to modify the phase-space distribution is much longer 
than the dynamical time is well grounded and we can treat large 77-body 
systems in the framework of a collisionless dynamics. This means that the 
orbit covered by a typical star, for example, in an elliptical galaxy in a time of 
several billion years is, to a very good approximation, one of the trajectories 
admitted by the average potential of the galaxy. 

In the light of this, the potentials that will be examined here are of the 
same kind as those generated by the simplest density distributions g{r) used 
to create galactic models: distributions sharing the property of a monotonic 
decrease outward and characterized by 



a) spherical symmetry, {g = g{r), ~ x‘^ y‘^ + z‘^); 

b) spheroidal symmetry, {g = g{vu, z), -h y‘^); 

c) ellipsoidal symmetry. 



We already know (Sect. 2.1) that the motion in a spherical potential can be 
reduced to a problem with only one degree of freedom and is therefore inte- 
grable. It is of great importance to note that even the ellipsoidal potentials 
exploited to build galactic models in many respects behave like integrable 
systems, as has been possible to verify in several reliable “numerical exper- 
iments” . Therefore, the properties of the orbits in integrable spheroidal and 
ellipsoidal potentials are useful to illustrate some of the aspects of the struc- 
ture of real stellar systems. 

The possibility of showing analytically that a system is integrable is 
granted by the knowledge of a number of integrals of the motion, independent 
and in involution, equal to the number of the dimension of the configuration 
space of the system (Sect. 1.18). The explicit expression of the integrals of 
motion in terms of the coordinates of the phase space, as already remarked in 
Sect. 1.5, is possible only in a very limited number of cases and, in particular, 
when it is possible to associate the integral with a symmetry of the potential 
and with an invariance property of the coordinate system. 
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In the cases examined here an integral of the motion which is always 
present is the energy of the point particle: we are in fact interested in sta- 
tionary configurations, so that the Hamiltonian does not depend explicitly 
on time, and then 

h=H{p,q)=E 

is the corresponding integral of the motion. The existence of /i = E can 
be related to the symmetry of the homogeneity of time, which, expressed 
in the setting of Noether’s theorem, corresponds to the conservation of the 
integral (1.B.60), owing to the invariance of the Lagrangian under the time 
translation. On the other hand, we are clearly not interested in the case 
of space translation symmetries, since realistic galactic potentials are never 
translationally invariant (unless in some cases of local approximation). But 
there will surely be some cases of rotational symmetry: in the case of spherical 
symmetry there will be the three integrals linked to the three possible spatial 
rotations which are the three components of the angular momentum of the 
particle along the orbit: 

^ = ^(r) 4=^ I 2 = Lx ^ = Ly^ I 4 = Lz. 

The three integrals / 2 ,/ 3,/4 are obviously not in involution. We shall see 
in Sect. 5.2 that a combination of them, the absolute value of the angular 
momentum, 

\L\ = yjLj + Ly^+L,^ 

which is in involution with /i, together with I\ and one of the components, 
say Lz, is sufficient to describe the motion completely. 

In the case of spheroidal symmetry, the invariance with respect to the 
rotation around the symmetry axis (the 2 : axis, by definition) determines the 
conservation of the component of the angular momentum around this axis, 
which is therefore an integral of the motion: 

^ — 4=4 l 2 =Lz. 

As we have already remarked, we know of the possible existence of an in 
case (b) above and of an I 2 and an Is in case (c) which, in general, are not as- 
sociated with any symmetry of the kind envisaged above. Their explicit form, 
however, can be found only in some particular situations and by exploiting 
dedicated techniques that will be illustrated in what follows. 

From the preliminary discussion of the Sect. 1.1 we know that the maxi- 
mum number of independent conserved quantities in an autonomous system 
with n degrees of freedom is 2n — 1. It is therefore necessary to establish a 
relation among the functions 

Fk = Fk{p,q), fc=l,...,2n-l (5.3) 

and the integrals we spoke about in the case of integrability: the n indepen- 
dent integrals in involution 7^, a = 1, . . . , n, for the nature of the motion in 
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phase space, have the property of confining, or “isolating” , the trajectory to 
the invariant tori. Following Wintner’s denomination, in galactic dynamics 
they are named isolating integrals. It is important to note, however, that the 
number of isolating integrals can be even greater than n because they may 
not be in involution. For example, because of the commutation relations 

[Lx’S Ly] = Z/2;, [Ly^ Lz\ — Lxt \Lz<) Lx] = Ly^ 

the three components of L are not in involution (the analysis of the motion 
in the spherical potential in Sect. 5.2 will shed further light on this aspect of 
the behaviour of an integrable system). 

It is clear in addition that, in general, for isolating integrals no prescription 
exists that allows the determination of the form of their dependence on the 
phase-space coordinates. A simple but important case in which it is possible 
to find the explicit form of one or more isolating integrals is that in which the 
system is subject to resonances. Referring to the discussion of Sect. 1.3, we 
note that an integrable system is in resonance if two, or more, of its frequen- 
cies are commensurable. Given two integers m^, mt> a,5=l,... ,n, two 
frequencies are commensurable if 

UJarria - (jObTUb = 0, (5.4) 

and we say that the resonance is of order mblrUa- The frequencies are func- 
tions of the actions, so that, if we vary the J, the invariant torus changes 
and, with it, the corresponding frequencies For suitable values of J 

condition (5.4) will be satisfied and the corresponding torus will be called 
“rational” , or “resonant” : in particular, if n — 1 independent relations of type 
(5.4) are satisfied, the trajectory described by a solution of the form (1.C.149) 
after mi periods Ti — 2 tt / uji^ ... , periods returns to the 

same values of ^(0). It will therefore be closed, instead of covering densely the 
torus, and the trajectory will be “confined” to a manifold with a dimension 
less by one than in the general case, so that even in this case we can speak of 
other isolating integrals. The most interesting circumstance is that in which 
one, or more, relations of kind (5.4) are always satisfied (independently of 
the values of J). For every resonance so established, the identically vanishing 
function 

4+1 = 6ama - Obmb (5.5) 

will be an integral in involution with the already known In. Standard exam- 
ples of this situation are that of the Keplerian potential and the harmonic 
potential, as particular cases of spherical potentials (Sect. 5.2): in the first 
case the radial frequency uji = Ur and the azimuthal one uj 2 = coin- 
cide (order 1 resonance) and in the second case we have uoi — 2 uj 2 (order 2 
resonance) . 

From the previous discussion we see that, in general, the number of iso- 
lating integrals will be less than (and, only in some particular cases, equal to) 
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the number of integrals of the motion, which, for (5.3), is always k — 2n — 1. 
The fundamental property that distinguishes, from the mathematical point 
of view, the isolating integrals /^(p, g), a = 1, . . . ,nj from the other, non- 
isolating, integrals 

is that the /^(p, g) are one- valued functions of the phase-space coordinates, 
whereas the Fa are many-valued functions. To see this, assume that we know 
nj isolating integrals /q. The level hypersurfaces of these ni functions will 
define a submanifold of dimension 2n — nj (see Sect. 1.2) 

Mi = {{p,q)\Iaip,q)=Ca}. (5.6) 

Varying the parameters Ca we get an n/-dimensional family of hypersurfaces 
Mi, which makes up a foliation of the phase space, since these submanifolds 
cannot intersect themselves. In fact, if a trajectory of the dynamical system 
has a point on Mi — Ca, it lies entirely on Mi, because from the definition 
of integrals of motion, and the theorem of uniqueness of the solutions of 
differential equations, two of these hypersurfaces cannot have any point in 
common. Suppose we are given another integral I'{p,q); consider a point 
(Po, ^o) ^ trajectory passing through it. /'(p, q) is isolating, with 

respect to this trajectory, if it is possible to find a subset of Mi of dimension 
2n — ni (the same as that of Mi) in which no point is arbitrarily close to 
the hypersurface: 

I'(p,q) = I'iPo,Qo)- 

Otherwise, /' is non-isolating. If therefore there are no more isolating inte- 
grals, the trajectory will densely cover the hypersurface Mi: given a neigh- 
bourhood D of (Po^^o)^ infinitely many times in every subset of 

D (see the recurrence theorem of Poincare in Sect. 1.10). The Fa then estab- 
lish some links among possible coordinates on Mi, where = A^(p, q): 
making one of these (e.g. Ai) explicit with respect to the others, we find that 
all the values of the other A^ must correspond to one value of Ai. Therefore 
FA{p^q) are many- valued functions. 



5.2 Orbits in Spherically Symmetric Potentials 

In the previous section we noted that the potentials of interest in galactic 
dynamics are assumed to be integrable, and therefore the most powerful 
technique for studying these systems is that based on the use of the action- 
angle variables (see Sect. 1.16). Let us then study the simplest case, that of 

^ This definition of the isolating integral is due to J. Binney, S. Tremaine: Galactic 
Dynamics (Princeton University Press, 1987), p. 113. 
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the spherically symmetric potential, in the framework of the Hamilton- Jacobi 
theory. This approach offers the following advantages: 

a) many results that are true in the spherical case are, in the action-angle 
formalism, qualitatively true also in other cases, and therefore they con- 
stitute a prototypical situation to which we will often refer; 

b) in the particular case of Kepler’s problem (Chap. 2), one obtains the De- 
launay elements directly; 

c) the action-angle settings are the best ones for constructing the perturba- 
tion theory (as will be seen in Volume 2). 

We shall therefore undertake the treatment of the motion in a central force 
field, closely following the exposition of the theory we gave in Sects. 1.16 and 
1.17. This subject is of such great relevance for its physical applications that 
the number of treatments available in the literature is enormous. We think 
that one of the best expositions of all is still the beautiful book by Max Born,^ 
whereas for applications in stellar dynamics the best up to date reference is 
the already cited book by Binney and Tremaine.® 

We are given the Hamiltonian of a particle of unit mass in a gravitational 
potential that is invariant with respect to all rotations, 

= ^{pI + pI + pI) + (5-7) 

where z are Cartesian coordinates, px — Py = Pz = ^ their con- 
jugate momenta and ^ a monotonically increasing function tending to zero 
if its argument goes to infinity. With a generating function of the third type 
(see (1.C.72)), 



W3{p,Q) = W3{p^,Py,p^,r,'&,ip) 

= — {pxV sin i9 cos p + pyX sin 'd sin (p Pz^ cos 'd ) , 

we perform a canonical transformation to spherical coordinates 



2: 






arccos ■ 



p — arcsm 



y 



\/x^ + \ 



(5.8) 



whose conjugate momenta are 



Pr = r. 


(5.9a) 


Pd = 


(5.9b) 


p^ = sin^ dp. 


(5.9c) 



^ M. Born: op. cit., in Sect. 1.17. 

® J. Binney, S. Tremaine: op. cit., pp. 103 ff. 
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The Hamiltonian in the new coordinates is 



^ J fpr + + 2 2 y j) + (5-10) 

^ \ H sin 17 ^ J 

According to the theory of Sect. 1.16, to the Hamiltonian (5.10) there corre- 
spond the Hamilton- Jacobi equation 



dr ) V 



+ 2[^(r)-/i]=0, 



\drj ' r^\dd) ' r^sin^i^V^^y ^ ^ ^ ’ 

where the function S{Q^J) again generates a canonical transformation 
{Q^P) = (5 

by means of the partial derivatives 



:S{Q.J). 



e=^s(Q.j). 



To frame the problem in the best setting we will solve partial differential 
equation (5.11), obtaining, at the small price of a some heavy mathematical 
work, the most enlightening expressions for the solution of the problem, since 
they will be expressed in terms of the variables that offer the most direct 
physical interpretation. We know, from the general discussion of Sect. 1.15 
and the introductory treatment on isolating integrals in the previous section, 
that (5.11) is a candidate for a solution by separation of the variables. Let 
us therefore try to determine a complete integral of it, assuming a solution of 
the form 

5(r,^,^) = 5i(r) + 52(i?) + 53M. 

Putting this expression of S into (5.11), we can break it down into three 
ordinary differential equations: 



+ ^ + 2 [^(r) - h] = 0, 



^3 _ t2 

. 2 Q ~ ^25 

sm V 



which can be solved for the derivatives of S (according to (5.13)): 



Pr = a/2[/i -^(r)] - J, 



(5.14a) 



dSs _ _r 

dip 



(5.14b) 



(5.14c) 
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Of the three integration constants, we already know that I\ is the integral 
of the motion corresponding to the energy: Ii = E. To interpret the other 
two, we note that, in the spherical coordinate system, the angular momentum 
around the polar {i^ — 0) axis is 

Lz = r^(/9sin^ 'd = (5.15) 

and the absolute value of the total angular momentum is 

\L\ = ^ Ll-\- Ll = sin^ ^ sin“^ ??. (5.16) 

Equations (5.14c) and (5.15) therefore say that /s = Lz is the constant 
polar momentum. Equations (5.14b) and (5.16), however, say instead that 
I 2 = |T| is the constant absolute value of the total angular momentum (which, 
as is true in every central field, is constant also in direction). The orbit in 
space is always contained in a plane that has an inclination angle i with 
respect to the plane P = ttI 2 given by 



h 

i = arccos — . 

h 



(5.17) 



The line of intersection between the orbital plane and the plane = tt/2 
is called the line of the nodes. From (5.14) it is also possible to deduce the 
qualitative aspect of the orbit in the orbital plane: the p coordinate is cyclic 
and is therefore related to the rotation around the 2 : axis. The d coordinate 
is instead associated with a libration, symmetrical with respect to the plane 
'd = 7 t/2, whose limits are given by the zeroes of the argument in the square 
root in (5.14b), that is, by 

sind = ^ = cosi, i<d<^-\-i. (5.18) 

±2 2 2 

The radial character of the motion depends on the behaviour of the argument 
in the square root in (5.14a): if ^(r) is a monotonically increasing function 
with r and such that 

dL> I 

|^(0)| = l^ol < 00 , — = 0, lim ^ = 0, 

dr 10 r^oo 

the argument of the square root in (5.14a) will be positive for rmin < r^ax 
(where rmin and rmax are the only zeroes of the function inside the square 
root) and negative outside this ring, to which the orbit is therefore confined. 

As can be easily verified, EJ, |T| and Lz are isolating integrals of the 
motion in involution, so that the motion of the system is quasi-periodic 
(Sect. 1.17). To find a complete characterization of the motion, even to de- 
termine its progression in time, and to find the elements of the orbit, it is, 
however, better to pass to the action-angle variables (J,0). Starting from 
the canonical variables (P, Q), the action integrals, in virtue of (5.14), are 
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J.= 


1 

27T j 


fp^dr=^j 




(5.19a) 


Jt^ 


1 

27t j 




^ \ II - 

y sm d 


(5.19b) 


J (p — 


1 

27t j 




f hdip. 


(5.19c) 



where each integral is intended to be performed on a cycle (of libration in r 
and 1 ?,^ of rotation in (p). The third integral immediately gives 

J(^ — Is = Lz’ (5.20a) 



By means of the substitution cos = x sin i the second integral can also be 
performed directly. In fact 



= 



_ a/2 f V 

T T 



n/T 



— ax^ 



-dx, 



where we have defined 



a = 



II 



< 1 . 



For the indefinite integral we have 



/ 



1 f X 

„-dx = - arctan , 

1-ax^ a VvT^ 



— a 



arctan 



y/1 — a 



\/l — J 



Being a libration, the integral must be multiplied by 2 and evaluated between 
+1 and —1, which are the integration limits corresponding to 



2 



The result is simply 

J^ = l2-h = \L\-Lz. (5.20b) 

If we know the explicit form of ^(r), it is then possible to express the en- 
ergy in terms of J. The two results embodied in (5.20a) and (5.20b), inserted 
into the argument of the integral in (5.19a), enable us to reach the following 
important conclusion: the energy depends on Jr and on the sum Jr& -\- J^p. 
Therefore, the two frequencies 

dn _ dn 

‘^'d — o T ’ 07 

uJ'^ O J ip 

have the same value and the system is said to be simply degenerate, having 
a first order resonance (see Sects. 1.17 and 5.1). The degeneration can be 
removed by means of the simple canonical transformation 




9\ = Or 
O 2 — 0^ — Or 
O3 = Ocp - 0^ 
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J\ — Jr “h J'd “h J(^ — (jJr (5.21a) 

1/2 — J'O J ip ^2 — ^'0 LOr (5.21b) 

J3 = Jp cc;3 = Cc;<^ - = 0 (5.21c) 

so that the energy is a function of the form 



n = n{JuJ2). 

From (5.20a) and (5.20b), we now see that 

J2 = \L\; J3 = L,. (5.20c) 



The angular variables Oi^ i = 1,2,3, rather than Or.O^^O^^ are the variables 
that lead to the simplest representation of the motion on the orbit and that 
give physical meaning to some of the classical orbital elements. The [Oi^Ji] 
are canonical coordinates, so that the transformation relations linking them 
to the (Qi^Pi) coordinates are still given in the form of (5.13). The angular 
variables are then 



0 = ± 

* dJi dJi 



J Pr(Jl,J 2 )dr + j Pi){J2,J3)d'd + Jsip 



(5.22) 



which, expressed in terms of the isolating integrals and of the coordinates 
(r, I?, (f), contain all the information on the orbits. Equation (5.22), for i = 1, 
gives 



0i = 



d [ , T T fdprdU f f dr 

— J pAA,h)dr = y __* = a,, y — * = 0,1 y 



(5.23) 



where we have exploited (5.21a), by writing d'KjdJ\ — o;i, and where the last 
term of the equality is obtained by noting that depends on the Hamiltonian 
through the argument of the square root appearing, for example, in (5.14a). 
On the other hand, since the expression of the motion in terms of the angular 
variables is trivially given by 



=o;i(^-to), (5.24) 

the explicit integration of (5.23), when possible, gives the link between t and 
r, that is the equation for the radial coordinate of the orbital motion: 

dv dv 

— = dt^ -=t-to. (5.25o) 

Pr Jr{to) Pr 

To a cycle in which the angle 0\ increases by 27 t there corresponds a whole 
libration cycle of the radial variable, so that 
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is the radial period of the motion. If we generalize the definition which belongs 
to the Keplerian case, 0i coincides with the mean anomaly (Sect. 2.4). 
Equation (5.22), for i = 2, gives 



02 — 



dJ2 



j Pr{Ji,J 2 )dr + j p^{J 2 ,Jz)d'd 



= J (^^ + ^']dr+ f ^ 



\dndJ 2 ^ dJ2 



/ 



dJ2 






which, using forms (5.14a) and (5.14b) of the momenta in terms of the actions 
(keeping in mind the links established by (5.20) and (5.21) between actions 
and integrals of the motion) and dl-L / 9 J 2 = 0^2 , becomes 



02 



J \ J Pr J Pi) 



(5.26) 



Proceeding analogously, (5.22), for i = 3, gives 



0z = <p- 



J 



dm 

dJz 



(I'd = if 



J 2 



/- 



dd 



sin^ d 



J2 

1 13 

sin^ 



(5.27) 



With a procedure similar to that for calculating (5.19b), we find that the two 
integrals in d in (5.26) and (5.27) are respectively 



J2 



/-= / — 
J P^ J ^ 



dd 



l>-l 



COS^ I 

sin^ 'd 

:os idl'd 



cosd 



sin^ dJl - 

Y sm^ v 



= arcsm ( ^ — r I + const., (5.28) 

= arcsin (cot i cot d) + const. (5.29) 



Finally, to evaluate the integral in dr in (5.26), we can use equality (5.25a) 
written in the form 

UJ2 / — =o;2(^-^o)- (5.30) 

Jr{to) Pr 

Let us now introduce the azimuthal angle ^ representing the angular 
coordinate of the point particle in the orbital plane, measured counterclock- 
wise from the ascending node (see Fig. 5.1). From the definition of angular 
momentum and from (5.20c) and (5.25a) we have 

iri 7 2# 2 d'lp 



or, in integrated form. 



'tp-i’ih) = \L\ j 

J r\ 



dr 



r(to) r^Pr 



(5.31) 



If even in this case we generalize the definition of the Keplerian case, the angle 
'0— V^(to) turns out to be the true anomaly. With the symbols of Fig. 5.1, using 
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z 




Fig. 5.1 



relation (A. 3) of spherical trigonometry (see the Mathematical Appendix) 
applied to the spherical triangle NPK^ we can write the relation 



sin '0 sin i — sin PKN sin 




Since PKN is a right angle, this implies that the azimuthal angle can be 
written in the form 

/ cos'd \ 

lb = arcsin ( ) . (5.32) 

\ sm z ) 

If Q is the longitude of the ascending node, observing that 

/TOP = - - 1? 

2 

and using (A. 4) applied to the same triangle, we obtain the formula 

sin NOK = sm{(p — f]) = cot i cot 'd. (5.33) 

We are now in a position to make a physical interpretation of the angle 
variables 6i. Equations (5.33) and (5.29) put into (5.27) give 

Os = (p — arcsin [sin((^ — j?)] = i7, (5.34) 

so that 9s coincides with the longitude of the ascending node, and is constant, 
owing to the fact that lus is zero, recalling (5.21c). Subsequently, substituting 
relations (5.28), (5.30), (5.31) and (5.32) into expression (5.26) simply gives 

02=uj2{t-to), (5.35) 

whose interpretation, in spite of its apparently trivial form, is given by the 
following line of reasoning: recalling (5.20c) and (5.31), we can also write 
(5.26) in the form 
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O2 = - i>o + 



f 



(j 02 — 



J 2 \ dr 



= V’ - V'o + 



d 

dJ2 



/ Prdr, 

J ro 



where '0o = '0(^(^o)) is the value of ^ at a fixed ro = r(to). Now, the inte- 
gral / Prdr is a many- valued function of r that, in particular for a complete 
libration of the radial coordinate, increases exactly by Ji, and its derivative 
with respect to J 2 again takes its previous value. We see then that 62 is the 
angular distance (from the line of the nodes, for example) of the position 
of a point on the orbit with the radial coordinate fixed at the value tq. If, 
in particular, we choose ro = rmin (that is, the point corresponding to the 
pericentre), we can give an interpretation of relation (5.35) by saying that O 2 
represents the angular advance of the pericentre and that uj 2 is the precession 
velocity of the pericentre. In a libration in r, which by (5.25b), happens in a 
radial period = Ti , the pericentre advances by an azimuthal angle 

SO 2 = UJ 2 T 1 = 27 t — . ( 5 . 36 ) 

LUi 



Recalling, from definition (5.21b), the relation among the azimuthal frequen- 
cies, we can write 



27t / A'ljj ^ 

UJ2 = - UJi = — LJi ^ 1 ) Cc;i, 

\ Z7T 

where we have introduced the azimuthal period 



27T 



27T 27T 



Tj, = —rrTr = = 



Ai/j 






(5.37) 



(5.38) 



defined as the time necessary for '0 to increase by 27 t, and the change in the 
azimuthal angle corresponding to a radial period 









dr 

r‘^Pr 



(5.39) 



In the case for which A'lp is a rational submultiple of 27 t, 

Z\0 = n, fc G iV, 

k 

the orbit is closed and therefore penodic. We then have a resonance that, 
expressed in the new frequencies, is given by the relation 



miuji — m 2 UJ 2 = 0, mi = k — m 2 = k. 



After mi cycles in r and m 2 turns of the pericentre, the orbit closes in on 
itself. 

In the light of the results obtained, the general properties of the orbits 
in a spherically symmetric potential can be summarized by saying that the 
generic orbit can be represented as an annulus (densely filled up after many 
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periods) limited by r = rmin and r = rmax- To specify the orbit completely 
we need to know four parameters (the elements of the orbit ) : 

1) the inclination i; 

2) the longitude of the node i?; 

3) the pericentre distance rmin; 

4) the apocentre distance rmax- 

The first two elements (i and i?) are needed to fix the orientation of the plane 
of the orbit in space; the third and fourth (rmin and rmax) its extension. 
To these four elements there correspond just as many isolating integrals: 
treatment by means of the action-angle variables gives them explicitly as an 
independent set formed by the three action integrals J and by the constant 
angle ^3 = i7. If from these one wants to come back to the standard integrals 
mentioned from the outset in the previous section (case (a), la — {E,L))^ it 
is sufficient to use relations (5.20), the fact that tani? = LyjLx and equation 
(5.19a) to find the function E = E{J) explicitly. 

If, as we have said above, there is a resonance of order mi/m2 between 
uji and uj 2 ^ then, in agreement with (5.5), we have a fifth isolating integral: 



h = rUiOi - 171202, 



which, for (5.23) and (5.26), can be written as 



h = miUJi 






J 2 

^2 y 



dr 

Pr 




M 

P'S ’ 



and which, using the resonance condition mitoi — 102002 — 0, has the form 



birr'd) = m2 




— arcsin 



cos?? 
sin 2 



(5.40) 



E is isolating because, if we explicitly write out the function r = r(??), of 
which E = const, is the implicit form, for a given value of d there corresponds 
an integer number m 2 of isolated values of r, and so E is locally a one- valued 
function. If there is no resonance, then, as discussd in the previous section, 
a fifth integral of the motion exists, in this case given by 



ipo{r,'&) 



= arcsin 




dr 

9 ’ 
r^Pr 



but it is not isolating. 
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5.3 Orbits in Isochronal Potentials 



Let us apply the theory of the orbits in spherically symmetric potentials from 
the previous section to the very interesting case of the family of isochronal 
potentials. This class of models exhibits many attractive features in stellar dy- 
namics and has therefore been extensively studied since its introduction^ and 
also applied as an unperturbed model to study the orbital structure of more 
complicated systems by means of perturbative^ or other semi-analytical® 
techniques. The class of isochronal potentials is given by the function 



= - 



GM 

b + \/P~+r^ 



(5.41) 



where the positive constant GM, the product of Newton’s gravitational con- 
stant and a characteristic mass, must be identifyed with the constant /i used 
in Chap. 2 (see equation (2.16) and ff.). The function (5.41) is parametrized 
by the characteristic length b which gives the extent of the region where the 
potential resembles that of a homogeneous body, whereas, for r 5, it gives 
the typical behaviour of the potential far away from a finite body (of mass 
M), ^ oc r“^. As a particular case of fundamental importance, function (5.41) 
comprises the Keplerian potential, when 5 = 0: 

5 = 0 — ^i{r) = ^k(^) == • 



The aim of this section is to find the explicit form of the action-angle 
variables in a concrete case, along the lines proposed in the previous section. 
We shall also find, therefore, the classical Delaunay elements of the orbits of 
the two body problem, introduced in Sect. 2.5, since, in what follows, all the 
formulas of the Keplerian limit can be obtained by putting 5 equal to zero. 

The isochronal potentials are so called because they are characterized by 
a very simple relation between the radial period and the energy: 



_ 2ttGM 



(5.42a) 



independent of 5 and the other integrals of the motion. Let us start then by 
giving the proof of this relation. The radial period, from (5.25), is given by 
the integral 



Tr = 2 





rdr 

V^2r2 [E-^j(r)] -L2’ 



(5.43) 



^ M.Henon: L’amas isochrone I, Annales d’Astrophys. 22, 126-139 (1959); L’amas 
isochrone II, ibid. 491-498. 

^ O. E. Gerhard, P. Saha: Recovering galactic orbits by perturbation theory, Mon. 

Not. Roy. Astron. Soc. 251, 449-467, (1991). 

® C. McGill, J. Binney: Torus construction in general gravitational potentials, Mon. 
Not. Roy. Astron. Soc. 244, 634-645, (1990). 
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where ri and r 2 are the roots of the argument of the square root (from 
what we said above, there are two real roots). Let us introduce the auxiliary 
variable 

GM 

such that, solving for r, we have 



1 + 



rp2 



(5.44) 




s > 2, 



and 

rdr — b^{s— l)ds. 

In terms of s (5.43) becomes 



26 n {s - l)ds 

where 

-’^(Sl,S2) = (S2 - s)(s - Si), 



(5.45) 



and we have introduced the new limits between which the argument of the 
square root is positive and that are defined by 



^2 + Si — 2 



S2 - Si = 2 



S2S1 



GM 

gmV l2 

2Eb^ '' 

+ 4GMb 



2Eb‘^ 



We also define the useful quantities 

GM 



a = — 



2E 



- 6 ; 



e = \ 1 + 



L2 

2Ea2’ 



(5.46a) 

(5.46b) 

(5.46c) 



(5.47) 



for which, with hindsight, we will soon find an interpretation as orbital pa- 
rameters and that allow us to write the relations in (5.46) as 

S 2 = 2 -h —(1 -be). Si = 2 -f- -(1 — e), (5.48) 

b b 

5(s 2 -h si) = 2a + 45, 5(s2 — si) = 2ae, (5.49) 

5^S2Si = 46(a + 5) + a^(l - e^). (5.50) 

Exploiting the above relations (5.48-50) to write all expressions where 
S 2 and Si appear in terms of a, 5 and e, the indefinite integral appearing in 
(5.45) amounts to: 




318 5. Orbits in Given Potentials 



r \ CL + b . f b{s — 2) — a\ ^ 
r[s) — — - — arcsin I ] — VX, 



ae 



(5.51) 



SO that the radial period becomes 
2b 



T — 

±r — 






27t 2-kGM 

(t(s 2 ) - r(si)) = -j=={a + b) = 






i-2E) 



3/2 ' 



and so equality (5.42a) is proved. The function t{s) defined by (5.51) allows 
us also to express the angular variable 0i in terms of the radial coordinate. 
In fact, recalling equation (5.23), we have 

f dr 27 t b 
^ 



Pr 



Tr 



It is convenient at this point to introduce the variable u (the eccentric 
anomaly), by means of 



bs = 2b a{l — ecosu), 



(5.52) 



so that the angle 6i can be written in the form 

6i — [(a + b)u — aesinu] — u sinu. (5.53) 

y —2E a -\- b 

The true anomaly f{u) = 'ip{u) — iIjq, defined such that corresponds to the 
pericentre, is obtained by integrating (5.31). Hence 



/ 



dr 

r‘^Pr 



1 

1 

V^^Eb 

1 



/ 

1 

2 



2L 



arcsin 



(5 — l)ds 
(s - 2)sVX 



i 



ds 

(s^^2)7v 



e — cos u 
1 — c cos u 



f ds 

J s^/X. 



2^/T?TAGMb 



: arcsin 



ac — (2b + a) cosu 
2b + a(l — ecosu) 



By using the trigonometric relations 



arcsin (3 



2 arctan 

2 



1-/3 

1 + / 3 ’ 




1 — cos u 
1 -h cos u ’ 



we can write the true anomaly as a function of the eccentric anomaly as 
follows 
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f{u) = ip{u) - V'o 



'/ 



dr 

r'^Pr 



= arctan 



1 + e u 

tan — 

1 - e 2 






\L\ 

+ AGMb 



arctan 



a(l + e) + 25 u\ 

»(l-e) + 2fc *“2j- 

(5.54) 



Recalling equation (5.39), we can evaluate by means of 

/ dr 

so that, from the definition given in (5.37), the commensurability relation 
between the azimuthal and radial frequencies is 



^ _ 1 / 

uJi ~ 2\^L^^4GMb 



so that, taking into account that, from the isochronicity relation (5.42a) and 
the first of definitions (5.47), the radial frequency can be written as 



(._ 2 £;) 3/2 ^ 

GM ~ {a + 6)3/2 ’ 



(5.42b) 



and the explicit form of the azimuthal frequency as 



vgm ( \L\ 

^ 2(a + 5)3/2 \^i2 + 4QMb )' 

Exploiting the function r(s) of (5.51), the time expressed in terms of the 
eccentric anomaly is 



t-to 



r b r {s - l)ds 

Pr Vx 

(r(s) - r(smin)) = ~ aesinw], 



(5.56) 



whereas the equation of the orbit in the form r — r{u) is, for (5.44) and 
(5.52), 

r = ayj (1 — ecosu)(l — ecosu + 2b/ a). (5.57) 

It is useful to have the expression of the momenta in terms of the coordi- 
nates. By means of (5.56) and (5.57), we have 



r 



dr 

dt 



dr du 
du dt 



dr / dt 
du \du 



GM aesinu 



Pi 



a + b r 



(5.58) 
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whereas for the result is readily obtained by recalling the general expres- 
sions (5.14b) and (5.32): 



Pi) = = \L\ (5.59) 

V sin d sin V 

As for the action integrals, for J 2 and J 3 the general results of the previous 
section give directly, from (5.20), 



J 2 ^ ^ = 1 ^ 1 , J 3 = = Lz = \L\ cosi = J 2 cosi. (5.60) 

To find Ji = Jr -h J 2 , we have to integrate the radial action integral (5.19a) 



-f 

^ Jvrr 



2[E-^i{r)]-^dr. 



This can be done directly, exploiting, for example, the techniques of integra- 
tion of complex analytic functions and the theorem of residues. The result 
is 



Jr = -^= - I {\L\ + VL^+^GMb) . (5.61) 



It is interesting and instructive, however, to obtain this result following an al- 
ternative route. From the fundamental property of the isochronal potential 
(5.42a), the radial frequency depends only on the energy, so that the partial 
derivative 



dn 

dJr 



dE 

~dTr 



— cji ( J/) — 



can be integrated in the form 






/: 



dE 

h{E) 



E f{L) — 



GM 



+ f{L)^ 



(5.62) 



where f{L) is a function to be determined. To this end, we exploit the prop- 
erties of circular orbits, that is, those orbits for which we have 

Vr = r = Pr = 0 Jr — = 0 



and 



2,2 2 



GMP 

v^P~T^(5 -h y/b‘^ + 



where v‘^ = Tc[d^ / dr)\r^ represents the equilibrium between the centrifugal 
and the gravitational force (on the circular orbit r = Vc). The energy and the 
angular momentum of a circular orbit are respectively 



Born: op. cit. in Sect. 1.17, Appendix on Elementary and Complex Integration, 
pp. 303 ff. 

^^J.Binney, S. Tremaine, op. cit., p. 186 
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= 



GM 

2\/b‘^ + ’ 



\L\^-) = r,VT = rc{E^^'^) - ^/) 



I GM 
-2bE^^) 



-2bEG) 

GM 



So that Jr vanishes, the condition 



/(L(<=)) = - 




must be satisfied for (5.62), where we have introduced the parameter 
Now, owing to (5.64), x satisfies the equation 



admitting solutions 



(1 -x)/vV, 



± v'(^W)2+4GM6 



As for the values admitted by x we must take the solution with the plus sign, 
so that the / function that we seek, which must be the same even if the orbit 
is not circular, has the form 



2GMb _ 1 

\L\ - VIXT^GMb ~ ~2 



(|L| + /l2T4GM5), 



which, put into (5.62), provides (5.61), as was to be proved. For (5.60) and 
(5.61) the integral Ji is then 



Ji = + I (l^l - VL^+4GMb) . (5.65) 

The inversion of this last relation provides the explicit form of the Hamil- 
tonian as a function of the action integrals 



n{JuJ2) 



2 [ Ji - i ( J 2 - yj|T4GM6) ] ' 



which, in one sense, is the main result of this treatment, since it gives a 
complete solution of the dynamical problem. 
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If we want to express the actions in terms of the model and orbital pa- 
rameters, recalling definitions (5.47), we obtain 



E = - 



GM 

2 (a + b) ’ 




GMg 2 
a b 



(l-e2), 



and inserting these into (5.65) yields 



(5.67) 



Jl - y/Gm^TV) 



1 + 



2(ft “h 6) 



v/r^ 



■ e"' + 



46 



1 + 



(5.68) 



whereas J 2 and J 3 can simply be obtained by inserting the second of equations 
(5.67) into (5.60). Prom (5.42b) and (5.55) the frequencies are given by 



y/GM 

~ (0 + 6)3/2’ 

from which we see that, in general, the two frequencies are not commensu- 
rable. According to the discussion of the previous section, the orbit densely 
fills the annulus comprised between rmin and Tmax* Their explicit expression 
as orbit elements can be obtained from (5.48), coming back to the original 
radial coordinate by inverting the transformation (5.44). The angle-variables 
are the mean anomaly ^i, given by (5.53), the angular advance of the peri- 
centre 



Co »2 



-(Jl 



(l + f)^-e2 



-1 



(5.69) 



(Jo 1 

O 2 = -6y = - 

(Jl 2 



(l + f)^-e2 



-1 



ae 

a b 



and the constant longitude of the ascending node Os given by (5.34). 

With these results we obtain a complete analytical description of the 
orbits admitted by the family of isochronal potentials. In the limit 6 = 0 (Ke- 
plerian potential) we again find the classical Delaunay elements of Sect. 2.5. 
The actions are 



Jl = y/GMa, J 2 = y/GMa{l - e"^), Jg = y/GMa{l-e^) cosi. (5.70) 
The frequencies are 



^1 = ^3/2 » o;2=a;3 = 0. 



(5.71) 



The angles are 



0i=LJi{t-to), 6>2 = V’o, 9^ = Q. 



(5.72) 



Taking account of the slight change of notation, we see that equations (5.70) 
and (5.72) coincide with (2.82) defining the Delaunay elements for the Ke- 
plerian motion and the first of equations (5.71) has the same meaning of 
(2.84b). 
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Putting b = 0 into Hamiltonian (5.66) gives 



niJi) = -- 



2Jf 



2[Jr + 



GM 

2a 






(5.73) 



which accounts for the complete degeneracy of Kepler’s problem, whereas 
b = 0 into (5.56) and (5.57) gives Kepler’s equation (2.53), 

, Vgm , 

= — TPT\^ -to) =u - esmu, 
a'^'^ 

and the equation of the orbit (2.50) 

r — a{l — ecosu). 



5.4 Elliptical Coordinates and Stackel’s Theorem 

In this section we introduce some of the curvilinear coordinate systems most 
useful for their application to analytical dynamics: in essence they are fam- 
ilies of elliptical (in two dimensions) and ellipsoidal (in three dimensions) 
coordinates and, owing to Stackel’s theorem, they play a very important role 
in the application of the Hamilton- Jacobi theory to the study of the motion 
in potentials endowed with spheroidal and ellipsoidal symmetries. In the case 
of motion in two dimensions we will also introduce conformal transforma- 
tions in the complex plane, discussing the link between elliptical coordinates 
and a conformal transformation that is particularly useful for applications 
in mechanics. For an exhaustive treatment we also give the relation between 
parabolic coordinates (as another class of curvilinear coordinates in the plane) 
and the Levi-Civita transformation, previously used for the regularization of 
the two-body problem (Sect. 2.6). A complete account of the application of 
ellipsoidal coordinates can be found in the classic work by Jacobi, whereas 
the general properties of curvilinear coordinate systems are reviewed in the 
textbooks by Whittaker and Watson, Morse and Feshbach,^^ and Jeffreys 
and Swirles.^^ As ellipsoidal coordinates have come back in vogue for their 
applications in galactic dynamics, there are very good expositions of their re- 
lation to the motion in galactic potentials by Lynden-Bell^^ and de Zeeuw.^^ 

G. J. Jacobi: op. cit., Lecture 26. 

T. Whittaker, G.N. Watson: A Course of Modern Analysis (Cambridge Uni- 
versity Press, 1927), pp. 547 ff. 

M. Morse, H. Feshbach: Methods of Theoretical Physics (McGraw Hill, New 
York, 1953), Chap. 5. 

^^H. Jeffreys, B. Swirles: Methods of Mathematical Physics (Cambridge University 
Press, 1956), pp. 157 ff., 532 ff. 

Lynden-Bell: Stellar Dynamics - Potentials with isolating integrals, Mon. Not. 
Roy. Astron. Soc. 124 , 95-123 (1962). 

^^P. T. de Zeeuw: Elliptical galaxies with separable potentials, Mon. Not. Roy. As- 
tron. Soc. 216 , 273-334 (1985). 
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Let X, y and z be the Cartesian coordinates in space. Consider the equation 
in the variable r 



f(j) = 



+ 



y 



+ 



r + ce ' T fJ r + 7 
where a, /? and 7 are constants such that 



1 = 0 , 



(5.74) 



a < /? < 7 < 0. 



For r = 0 (5.74) becomes the equation of an ellipsoid with semiaxes a = a, 
b = \/— /3, c = 7 referred to its own axes, whereas, if x, ^ and z are the 

coordinates of a point which is external to the ellipsoid, we have 



x^ y‘^z‘^ 

“2+t^ + ^“1>0. 



(5.75) 



If in (5.74), having fixed the coordinates x,i/andz of a point P, the variable 
r is considered to be the unknown, then we arrive at an equation of third 
degree in r: 



x^(r + /3)(r + 7 ) + y‘^{r + a){r + 7 ) + z^(r + a){r -f /?) 
-(r H- a){r + /?)(r + 7 ) == 0 . 



(5.76) 



All the roots of this equation are real. To find them we observe, first of all, 
that 



lim /(r) = - 1 . 



Then, letting r = —a + £ in (5.74), we find that the quantity 

f{—a + ^) = h — — 1 — 1 

£: p — a-\- e 7 — a + e 



is positive if e is sufficiently small. Therefore, equation (5.74) possesses a 
positive root lying between —a and -f- 00 , which we denote by A. 

If then, in the equation /(r) = 0, we alternatively put first r = — /? -h e 
and then r = —a — for e sufficiently small we have the inequalities 



f{~P + e)>0 and f{-a-e)<0, 

so that we can state that a second root, //, of /(r) =0 lies between —(3 and 
—a. If, finally, in (5.74), we put first r = —7 -h e and then r = —(3 — e, so 
that for s sufficiently small we have the inequalities 



/(-7 -\-e)>0 and f{-(3 - e) < 0 , 

we can conclude that a third root, z/, of /(r) = 0 lies between —7 and —(3. 
To summarize, the three roots of (5.74) satisfy the chain of inequalities 



—7 <v< —f3 < /i < —a < A. 



(5.77) 
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Given the Cartesian coordinates of a point, the three roots A, /i 

and u turn out to be uniquely determined: they can be interpreted as the 
ellipsoidal coordinates of the point. In turn, given the ellipsoidal coordinates, 
the inverse transformation exists, which allows us to find from the latter 
coordinates the corresponding Cartesian coordinates, but it turns out that 
this inverse transformation is not unique. Hereafter we establish the form of 
both these transformations and discuss briefiy the principal properties of the 
ellipsoidal coordinates. 

To obtain x^ysmdz^ as functions of A, y and u we observe that (5.76) 
can certainly be written as the product of binomials of the variable and the 
roots, so that 

-{t - \){t - fi){T - ly) =0. 

Equating the left-hand sides of this equation and of (5.76) and dividing by 
(r -h a)(r + P){r + 7 ), we find that 



/M = 



t + q; t + fd r + 7 



(t- A)(t-/x)(t-z/) 
(t + a)(r + 0){t + 7 ) 



(5.78) 



To find the coordinate, say, x, in terms of the ellipsoidal coordinates we 
multiply both sides of (5.78) by r + a and, after that, we impose the condition 
r = —a; proceeding in an analogous manner for the other coordinates we 
obtain 

2 (^ “I” + ^)(^ “1“ ^) 

^ (a-/?)(o;-7) 

= + + ( 5 . 79 ) 

2 ^ (A + 7)(At + 7)(t^ + 7) 

^ (7-q)(7-/3) 




Fig. 5.2a 



Fig. 5.2b 



Fig. 5.2c 
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To each set (A, /x, z/), lying in the intervals given by (5.77), there therefore 
correspond eight points, one for each octant. The surfaces A(x, y, z) = const, 
are ellipsoids (see Fig. 5.2a), since for inequalities (5.77) we have that 

ATq;> 0, A + /3>0, AH-y>0, 



and so, for r = A, the denominators of (5.74) are positive and the equation is 
therefore that of an ellipsoid: for large values of A this ellipsoid tends to the 
sphere of radius \/A + a, whereas as A tends to the limit — a the ellipsoid 
shrinks more rapidly in the direction of the x axis (its major axis always 
remains in the direction of the z axis) until at A = — a its minor axis becomes 
zero and the ellipsoid degenerates into the area delimited by the focal ellipse 
with the equation 



(/? - q) (7 - a) 



(5.80a) 



whose foci are at the points 2; = zb\/7 — j3 and which intersects the z axis in 
the points 2: = ib>/7 — a and the y axis at the points y = — a. 

The surfaces y{x,y,z) = const, are hyperboloids constituted of one sheet 
(see Fig. 5.2b) around the x axis, since inequalities (5.77) now give 



/xH-q< 0 , /iH-/3>0, /i + 7 ^ 0 * 



For fji — — a they degenerate into the surface given by the part of the plane 
X = 0 external to the focal ellipse defined above. With y tending to — /3 the 
ellipse by which the hyperboloid cuts the x = 0 plane shrinks and, in the 
limit jjL — — (3, it coincides with the segment on the 2: axis limited by the foci 
2: = ib^7 — (3. At the same time the hyperboloid degenerates into the part of 
the ^ = 0 plane lying between the two branches of the focal hyperbola given 
by 



x" 

(a- (3)^ p) 



(5.80b) 



This hyperbola has its foci at the points 2: = ±-^7 — a and cuts the z axis at 
the foci of the focal ellipse. 

The surface v — — (3 coincides with the portion of the y = 0 plane ex- 
ternal to the two branches of the focal hyperbola. The surfaces z/(x, y, z) == 
const., diminishing the value of are hyperboloids consisting of two sheets 
(see Fig. 5.2c), since inequalities (5.77) give in this case 



i/ + o;<0, z/ + /3<0, zx-h7>0. 



In the limit u = these hyperboloids coincide with the z = 0 plane. 

Through each point x, y and z pass one ellipsoid A, one hyperboloid of 
one sheet /x and one hyperboloid of two sheets v. Given the three values of 
X, y and z, the three corresponding values of A, /x and v are found by means 
of the relations 
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A -f /i + z/ = -a - /3 - 7 + + 2/^ + 2;^, 

Xfj. fj.iy + uX = aP + l3j 'ya - {(3 - (7 + - (o^ + P)z^ , 

Xfiu = —aP^ + P^x^ + + aPz^ , 



which are obtained by equating, in equality (5.78), the powers one, two and 
zero respectively of r. 

To determine the line element in ellipsoidal coordinates we can use the 
fact that the metric tensors of the two coordinate frames are linked by the 
relation 



dx^ dx^ 

9AB = 



45 = 1,2,3; i,j = 1,2,3; 



where = {A, /x, i^} and the matrix elements of the coordinate transforma- 
tion are defined as in (A. 7). We find then that, for e.g. A — I, 



9^^-Z^ydx) “ 4bA + a)2 ^ (A + /3)2 

^ dx^ dx^ 

^'12 = 2 ^-^-^= > 

^ dx'^ dx^ 



(A + 7)2 



= G 



2 



as can be verified by explicitly performing the derivatives of equalities (5.79); 
this allows us also to check the orthogonality of the coordinate system. To 
obtain expressions written only in terms of the r^, we can differentiate both 
sides of equation (5.78) with respect to r so that we have 



+ 






(r + (4)2 {r ^ py {t + j) 



{t- ii){r-v) 

(r -f- a)(r -h P)(r -h 7) 



dr (r + a)(r + /3)(r + 7) ' 



Putting r = A, this equality becomes 



y2 ^2 ^ (A-m)(A-4 

(A + 42 (A + /3)2 ^ (A + 7)2 (A + a)(A + /?)(A + 7)’ 



SO that in the end we find that 

r2 = p2 ^ 1 (A-/x)(A-4 

^ 4 (A + q:)(A + / 3 )(A + 7) 



(5.81a) 



Following the same route for the cases A = 2 and A = 3, we obtain respec- 
tively 




328 5. Orbits in Given Potentials 



G\ = Q'^ 
Gl = R^ 



1 A) 

4 (/x + a)(/i + /?)(/i + 7)’ 

4(i/ + a){i' + (3){v + 7) ’ 



(5.81b) 

(5.81c) 



SO that the line element, with general form (A. 6), in ellipsoidal coordinates 
becomes 

ds^ = P^dX^ + Q^dfi^ + R^diy^. 

It is clear that a three-dimensional coordinate system induces on given 
surfaces, in particular on planes, two-dimensional coordinate systems. The 
simplest and most important example we can describe is that of elliptical 
coordinates in the plane: putting 2: = 0 in (5.74) (and assuming, without loss 
of generality, that 7 also is zero) we can define the elliptical coordinates (A, 
/i) on the xy plane as the roots for r of the equation 



T P a 



+ 



r -h /? 



-1 = 0 . 



(5.82a) 



Without going through all the previous treatment of the three-dimensional 
case, to find all the relevant properties of the elliptical coordinates it suffices 
to put z = 0 (and 7 = 0) in all the expressions from (5.75) to (5.81). We 
easily find then that the generalization of inequalities (5.77) in this case is 



-/3 < M < -a < A, (5.82b) 

and that the relations between the Cartesian and the elliptical coordinates 
in the plane are 

2 _ (A -f a){ii -h ol) 

~ a-/3 

from which we can find 

X pL = —Oi — /? -h -h 
and, finally, that the line element is given by the expression 

ds^ = P‘^d\^ + 

where the components of the metric tensor are 

p2 — ^ A — /i 2 _ ^ /i — A 

4(A + a)(A + /3)’ ^ 4(/i + a)(Ai + ^)‘ 

The curves A = constant are ellipses with the major axis in the direction 
of the y axis and foci at the points y — ±\//3 — a. The degenerate ellipse 
A = — a is the segment of the y axis lying between the foci. The curves ji — 
constant are hyperbolas with the same foci. For ji — — j3 they coincide with 
the X axis; the value y — — a corresponds to the part of the y axis outside 



+ (5.83a) 



f3 — a 



Xy = af3 — j3x^ — ay^ 



(5.83b) 
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the foci. Near the origin the elliptical coordinates are approximatively similar 
to the Cartesian ones, whereas at large distances from the origin (A — )► oo) 
the ellipses become progressively almost circular and the hyperbolas tend to 
coincide with their asymptotes, which are straight lines passing through the 
origin. Therefore, at large distances from the origin, the elliptical coordinates 
tend to the polar ones. However, it is easy to verify that, in the particular 
case for which a = /3, the elliptical coordinates in the plane coincide exactly 
with the polar ones (r, ip). 

By using curvilinear coordinates in the plane we could very much enlarge 
the setting of the treatment, since their range of applications is very large and 
there are many links with other fields of analytical mechanics. The opportu- 
nity offered by the introduction of elliptical coordinates just made allows us, 
without too much digression, to briefly treat the coordinate systems gener- 
ated by conformal transformations in the complex plane and to shed light on 
their usefulness in applications. Further details can be found in the textbooks 
by Jeffreys and Swirles^^ and Markushevich.^^ 

Let us denote hy z = x iy the points on the complex plane and let 
w = F{z) be an analytic function. F{z) is therefore a single- valued function 
and its derivative F'{z) is continuous all over the plane: if we indicate with 
u{x,y) and v{x,y), respectively, the real and the imaginary parts of w, they 
must satisfy the Cauchy-Riemann relations (2.95), where the change of no- 
tation xi — > X, X 2 — y^ ui ^ u, U 2 V is assumed. The transformation of 
the complex plane generated by the function F{z) can be interpreted as a 
transformation of coordinates of the kind described in Appendix 2. Applying 
relations (2.95) we soon see that the Jacobian of the transformation is not 
zero and that, using the metric tensor transformation (A. 7), the line element 
in the two systems can be written 



dx^ + dy^ 




(du^ -h dv‘^) 



dx 

du 



du 



[du^ -h dv^) 



= \f'{w)\^ {du^ + dv^) , 



where we have introduced the inverse function 2 ; = /(rc), so that, clearly, we 
have 

/'(^) = F^y ( 5 - 85 ) 

Two line elements that are related to each other by a multiplicative factor 
are said to be conformal; the function F{z) therefore generates a conformal 
transformation. Without entering into the general exposition of the theory 



Jeffreys, B.Swirles: op. cit., pp. 419 ff. 

^®A. I. Markushevich: The Theory of Analytic Functions: a brief course (MIR, 
Moscow, 1983), pp. 60 ff. 
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of conformal transformations, in the following we describe two particularly 
useful examples. 

The first one represents a different way of introducing elliptical coordi- 
nates. Consider the analytical function 

z = f{w) — Z\sinhrc, (5.86) 

with A some real constant, which, exploiting the explicit expression for the 
hyperbolic function in terms of the complex variable 

sinh(rc) = sinh(n + iv) = sinh u cos v i cosh u sin v, 

provides the coordinate transformation 



X — sinh ix cost’, 
y = Z\ cosh sin r’. 



(5.87) 



with conformal factor 



\f'{w)\^ — A^\ cosh(ic)|^ = A^ (sinh^ u + cos^ ^’) . 
It is then easy to verify that the further transformation 



sinh^ u = 



A + a 
p-a 



sin^ V = 



M + /^ 

P — a 



A^ = P -a 



reduces expressions (5.87) to the standard form of equation (5.83a) of the 
relation between plane Cartesian and elliptical coordinates. It can also be 
easily verified that the properties of the conformal elliptical coordinates u 
and V are consistent with those examined above, noting in particular that 
the curves u = const, are the ellipses and the ones at v = const, are the 
hyperbolas. 

The second example of great relevance is that obtained by using the func- 
tion 2 : = f{w) — w‘^. In this case we have the coordinate transformation 



X = vP — ^ 

y = 2uv^ 



with conformal factor 

\f{w)f = 4|rcp = 4{u^ -h v‘^). 

This coordinate transformation was introduced in Sect. 2.6, where it was used 
to discuss the problem of the regularization of the two-body problem, and 
is known as the Levi-Civita transformation (2.94). It is easy to see that the 
curves with constant u and v are parabolas with the axis given by the x axis, 
focus at the origin and concavity respectively towards the left and the right. 
If afterwards we introduce the further transformation 

A = (lA + ^’)^, ii = (u — ^’)^, 
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we obtain the parabolic coordinates (A,/x), already introduced in Sect. 1.15 
for the study of the resonant anisotropic oscillator, as can be verified by 
observing that the relation between these coordinates and the Cartesian ones 
is 

X = 

1 . . A,/i>0, 

in agreement with (1.C.113). 

If two or three of the constants a, /3 or 7 in (5.74) are equal, the ellipsoidal 
coordinates assume particular forms in which these degenerations refiect some 
further symmetry. 

If /3 = 7 the coordinates (A, /a, v) become oblate spheroidal coordinates 
(A, /i, 7 ). The surfaces for which A = const, are oblate spheroids with the 
minor axis in the direction of the x axis. The surfaces at ja = const, are 
hyperboloids of revolution around the x axis. The interval in which u can 
range vanishes, so that it can no longer be used as a coordinate. The surfaces 
at u = const, in the general case (hyperboloids with two sheets around the z 
axis), are now the planes that contain the x axis and that are now labelled 
by a constant value of the azimuthal angle 7 . In every plane at 7 = const. 
{meridional plane) the coordinates (A,//) are elliptical coordinates, defined 
now as the roots for r of the equation 




+ 



r + /? 



1 = 0, 






(5.88) 



where —P</a<—a<\. The foci are at z = ±v^/5 — a and the relations 
between (A, /i) and (x,z) can be deduced from (5.82) and (5.83). In the 
equatorial plane x = 0 the coordinates are polar (z, x) with z^ = r + /? (r = 
Aor /i). The focal ellipse X = — —a in the yz plane of the general case, now 

degenerates into the focal circle of radius z = y/f3 — a. The line element is 
now 

ds‘^ = P^dX^ + Q^d[j? + z^d^ , 



where the components of the metric tensor are the same as in (5.84). 

If /? = Of, the coordinates (A, //, v) become prolate spheroidal coordinates 
(A,(/), z/). The surfaces with A const, are now prolate spheroids elongated 
in the direction of the z axis. The surfaces at iz = const, are hyperboloids 
of revolution with two sheets around the z axis. The interval in which y 
can range vanishes {y = — P = a), so that it can no longer be used as a 
coordinate. The surfaces with y = const, of the general case (hyperboloids 
with one sheet) are now planes that contain the z axis and that are therefore 
labelled by a constant value of the azimuthal angle 0. In every meridional 
plane at (f) — const, the variables (A, u) are elliptical coordinates, now defined 
as the roots for r of the equation 



w 



H 

T a r -f 7 



1 - 0 , 



2 2,2 

w = X P y , 



(5.89) 
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where —7 < v < —a < A. The foci are at tu = 0, 2: = ±^^7 — a and the 
relations between (A, v) and {vo^z) can be deduced from (5.82) and (5.83). 
In the equatorial plane the coordinates with 127^ = A + a, are polar 

coordinates. The line element is now 

ds^ = P^dX^ + w^dc/)^ + P?dii^ ^ 

where the components of the metric tensor P and R are given by expressions 
analogous to (5.84). 

If, finally, 7 = /? = o;, we are back with the spherical coordinates 
of (A. 8), with = X a. 

We now come to the question of the separability of the Hamilton- Jacobi 
equation. As already mentioned in Sect. 1.15 this problem is related both to 
the properties of the potential and to the coordinate system used. If we limit 
ourselves to orthogonal coordinate systems, that is, those given by a line ele- 
ment of type (A. 8), Stackel’s theorem (see Sect. 1.15) provides the necessary 
and sufficient conditions which must be satisfied at the same time by the po- 
tential and the coordinate system in order that the dynamical system under 
study be separable. From considerations which will be made in Sect. 5.8 
it turns out that some potentials of interest in astrophysics can satisfy with 
good approximation the conditions of Stackel’s theorem if ellipsoidal coordi- 
nates are used. 

Stackel’s Theorem. Let a Hamiltonian dynamical system be given with a 
Hamilton function^ ^ 

+ i- 1,2,3, 

where the kinetic energy is given by the last of equations (A. 9). The Hamilto- 
nian system is separable if and only if there exists a regular 3x3 matrix Wj , 
such that the elements on the i-th row are functions only of the coordinate , 
and a vector whose i-th component is a function of only the coordinate 
q'^^ such that 

Gpw, <? = y] Gpw\ (5.90) 

i i 

It can be observed that, if we denote by U'^j the inverse matrix of Wj, so 
that 

k 

then relations (5.90) imply that 

Gp^U\; ^ = (5.91) 



^°H. Goldstein: Classical Mechanics (Addison Wesley, 1980), Appendix D. 

^^The theorem can be generalized to N dimensions, but we will limit ourselves to 
the N = 3 dimensions of ordinary space. 
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Proof. Let us first show that conditions (5.90) are necessary for the separa- 
bility. Assume, therefore, that the system is separable, that is that the partial 
differential equation (which is the specialization to the present situation of 
general equation (1.C.90)) 

admits a complete integral of the form 

5* = y] 5*, 5* = s:(q\ ai, as, ^ 3 ). (5.93) 



By then substituting the complete integral (5.93) into (5.92), this equation 
is satisfied identically for every value of and ai in the domain of interest. 
Differentiating with respect to any ai we obtain 



Eg,-" 



dS* 

dq^ doi\dq^ 



= i> 



dS^ 

dq^ daadq^ 



0, a = 2, 3. (5.94) 



The coefficient of G- ^ in any one of equations (5.94) is a function of q'^ only, 
since is of the form (5.93) and the determinant of these coefficients is 



dS-^ dS^ dS^ 
dqi dq2 dqs 



dadq 



By choosing a set {ai,a 2 ^as^ such that V does not vanish, the algebraic 
system (5.94) takes just the required form of the first of (5.90) with 



, _ dS* d^S* 

^ dq^ dajdq^ ’ 

whereas (5.92) can be inverted to give 









-2 



7H 1 



which is just the form required by the second of (5.90). This completes the 
proof that conditions (5.90) are necessary for the separability. 

Let us now prove that (5.90) are sufifcient: in fact, they allow us to write 
the Hamilton- Jacobi equation (5.92) in the form 






-2 

i 



dq^ 



2 



2[Y,o^kU\-W' 



Kk=l 




where a 2 and are arbitrary constants. It is then clear that a principal 
function of the form given by (5.93) is a complete integral if Sf satisfy the 
equalities 
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StiQ') = f Tiiq^) = 2 - W 

\ k 

where the lower bound of the integration interval is chosen either as a constant 
or as a simple root of Ti. The solution of the dynamical problem is therefore 
reduced to quadratures by integrating 



t — to 



-Pa 



Pi 



dS* f U\dq^ 

dai y/fi{qi)' 


(5.95) 


dS* f U\dq^ 

daa Ey 


(5.96) 


dS* r—^ 


(5.97) 



Equations (5.96) determine the trajectory in the space (without taking 
account of the time dependence). Together with (5.95), they provide the 
solution of Lagrange’s problem. Equations (5.97) give the components of the 
momenta as functions of the coordinates. This completes the proof of Stackel’s 
theorem. 
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We have emphasized in various circumstances above that the Hamilton- 
Jacobi method (see Sect. 1.15) is the more powerful technique for solving 
problems in mechanics. For this reason and also with the aim of giving a uni- 
fied view of the subject, the study of orbits in fixed potentials will be made by 
exploiting, as far as possible, the separability of the Hamilton- Jacobi equa- 
tion. It is therefore obvious that Stackel’s theorem is an important result in 
this framework, since (5.95-97) give, at least formally, the complete solution 
to the problem (provided that the potential and the coordinate system obey 
suitable conditions). In Sect. 5.1 we also focussed our attention on the impor- 
tant role played by the isolating integrals, whose direct determination (when 
possible) allows a complete analysis of the motion. The aim of the present 
section is therefore the search for the second integral in a conservative system 
with two degrees of freedom, without the assumption that the conditions of 
Stackers theorem are satisfied. The result of the analysis is then that the 
existence of a second integral (quadratic in the momenta) implies that the 
potential and the coordinate system do satisfy Stackel conditions, which are 
then recovered in a completely independent way. 

In this section, at variance with the rest of the chapter, we study the 
motion of a point mass in the plane, where a two-dimensional potential 

^{x,y), {x,y}eM^ 
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is defined. The general problem of a Hamiltonian systems with two degrees 
of freedom is of fundamental importance by itself, but the following consid- 
erations demonstrate the usefulness of studying motion in two-dimensional 
potentials also for the help given in cases of interest in the three-dimensional 
world. First of all there are cases in which a symmetry of the potential al- 
lows us, in a suitable coordinate system, to reduce the number of degrees of 
freedom by means of the effective potential (an important example is that of 
rotational symmetry, which is examined in Sect. 5.8). A second point is the 
existence of planar orbits in every triaxial potential provided with refiection 
symmetry planes: their analysis, even if seldom possible analytically, sheds 
light on the general properties of spatial orbital motion. Finally, mass density 
distributions exist (for example the central regions of barred spiral galaxies) 
that give rise to gravitational potentials fairly well approximated by planar 
potentials. 

We do not have a general method for solving the equations of motion 
of a system with two degrees of freedom. This problem is related to the 
impossibility of verifying the existence of (and finding explicitly) a second 
isolating integral. Therefore, in what follows we consider prototypical cases in 
which it is possible to integrate the system in the sense of Liouville’s theorem 
(Sect. 1.19), postponing to Volume 2 the study of those cases in which the 
only viable route is that of perturbation theory. 

The prototype of integrable dynamical systems with two (or more) degrees 
of freedom is the anisotropic harmonic oscillator (see Sects. 1.3, 1.7, 1.16). In 
strict analogy, every potential of the form 

${x,y)=$i{x)+^2[y), (5.98) 



supports the two isolating integrals 
Ix == Px T 2^1 (x). 



h =Py +2^2(2/). 



(5.99) 



The motion of the system is given by the superposition of the librations along 
the two axes obtained as the solutions of the decoupled system of equations 



X + = 0 , 

ax 



d^2 

dy 



which is the generalization of (1.A.38), the equation of the two-dimensional 
harmonic oscillator. In agreement with Liouville’s theorem on integrable sys- 
tems, it is possible to decouple the two degrees of freedom in a general sys- 
tem if we are able to find a second isolating integral, namely a function 
hiPxiPy^x^y)^ which is constant along the trajectories of motion, that is 
/2 = 0 (obviously, the first isolating integral, /i, is the energy). A direct 
method for fulfilling the quest for the second integral is that of assuming a 
given functional form for I 2 and enforcing its conservation along the trajecto- 
ries by means of the condition expressed by (1.C.40) that the Poisson bracket 
with the Hamiltonian 
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^ (Px + pI) + y) = h, 



be zero, 



{i2,n) = o. 

In other words, the partial differential equation 

dh dh dh dH 

“q — P^ ~a~Py ~ Q 5 ^ 

ax ay opx ax 



dh dH 

dpy dy 



(5.100) 



must be satisfied. A general review of the results obtained by this method is 
that by Hietarinta:^^ let us examine here some simple cases with applications 
in galactic dynamics. 

We will limit ourselves to the case in which the second integral is a polyno- 
mial in the momenta. Let us first examine the very simple case of an integral 
linear in the momenta 



h = A{x, y)px + B{x, y)py + C{x, y). (5.101) 



We have to find what kind of potentials admit an integral of this form, ex- 
plicitly determining the functions A^B^C and The condition expressed 
by (5.100) must hold for any value of the phase space coordinates. Equating 
to zero the coefficients of the polynomial expression obtained by inserting 
(5.101) in (5.100), we find the following set of equations: 



— -0 — -0 —- — -0 

dx “ dy ~ dx~ dy~ 

dy ^ dx ’ dx ^ dy 



(5.102) 



The set of equations (5.102) has the following solutions: the function C is 
simply a constant, which can be put equal to zero without loss of generality. 
The functions A and B are given by 



A = ay B — —ax + c, 

where a^b and c are arbitrary constants (perhaps complex). With these ex- 
pressions for A and 5, the form of the second invariant can be written ex- 
plicitly and the last of equations (5.102) can be solved for There are two 
distinct cases: a — 0 (the uniform force field), for which we find that 

y) = f{cx - by), h = bpx + cpy, 

and a 7 ^ 0 (the central force field), with 

^{x, y) = f{x^ + y^), h = ypx - xpy, 

giving the conservation of angular momentum (/ is an arbitrary function of 
its argument). 

Hietarinta: Direct methods for the search of the second invariant, Phys. Rep. 
147, 87-154, (1987). 
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Let us move on to the case of the invariant quadratic in the momenta, 
namely 

I 2 = ApI + Bp^py + CpI + D, (5. 103) 

where A, B, C and D are functions of x and y. Proceeding exactly as in 
the linear case, equating to zero the coefficients of the polynomial expression 
obtained by inserting (5.103) in (5.100), we find the following set of equations: 



dA 

dx 

dD 



dC _ dAdB_ ^ 

dy dy ^ dx ’ dy ^ dx 



0 , 



^,d^ d$ dD d$ ^d$ 

dx~^^dx^^dy' dy~^dx^^dy’ 



The set of equations (5.104) has the following solutions: 

A — ay^ -\-hy^-c, B — —2axy — bx — dy+e^ C = ax^+dx+/, 



(5.104) 

(5.105) 



(5.106) 



where a^b^c^d^e and / are arbitrary constants (perhaps complex). Inserting 
these functions in set (5.105), we obtain the following integrability condition 
for D: 



{2axy + bx + dy + e) ^ 

- 2 (a{x‘^ - y^) + dx - by + f - c) 



“h 3(2dy T 5) 



dx 



— — - 3(2ax + d)— = 0. 
oxoy oy 



(5.107) 



This equation can be solved for all possible choices of the constants. Let us 
therefore examine the most interesting cases. 

1) a ^ 0, f b = d = e = 0. This case was solved by Darboux^^ by the 
method of characteristic curves (see, e.g., Courant and Hilbert^^) in the case 
for which a — 1 (simply equivalent to the general case a ^ 0, since I 2 contains 
a as a constant factor). To accord with the treatment of elliptical coordinates 
we define the new constants a — —f and jS — —c. For a partial differential 
equation with the structure of (5.107), the corresponding differential equation 
for the characteristics is 



xy (dy^ — dx^) + (x^ — y^ P — a) dxdy = 0. 

This equation, with the change of variables t is equivalent to 

Clairaut’s equation 



ds , ds f ds 

t— -s = (a-P)--(-- 
dt dt\ dt 



Darboux: Sur un probleme de mecanique, Archives Neerlaindaises (ii), 6, 371- 
377 (1901). 

Courant, D. Hilbert: Methods of Mathematical Physics (Interscience, New York, 
1953). 
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whose solution is given by the family of curves 

s(t: m) = mt — (a — p) 

where the arbitrary parameter m labels a single curve in the family. Returning 
to the Cartesian coordinates, we can represent the integral curves of the 
characteristic equation in implicit form by the expression 

(m + l)(mx^ — y^) = {a — P) m. 



By again changing the parameter of the family from m to r according to the 
relation 

ma + P 
^ 

m -h 1 

we are finally able to write the family of integral curves in the elegant and 
simple form 

T a T P 

in which we recognize (5.82a), and which leads us to the introduction of 
elliptical coordinates A and /i in the plane as the roots for r of (5.82a). We 
have then arrived at the interesting fact that the characteristic curves of 
(5.105) are in practice two families of confocal conics. The general theory of 
partial differential equations thus shows that, on passing to the characteristic 
variables A and /x, (5.107) can be cast in the canonical form 



^2^ 

dXdji 






(5.108) 



where A and M are functions of A and /x, which can be determined by ex- 
plicitly changing variables from Cartesian coordinates to elliptical ones by 
means of transformation rules (5.83a). However, this boring calculation can 
be avoided if we observe, following Darboux, that (5.108) possesses, as par- 
ticular solutions, the cases of the spherically symmetric potential and of the 
potentials which are already separated in Cartesian coordinates, that is, those 
given by (5.98). Let us consider, for example, the two simple cases 



— A-|-/x-|-q;-|-/3, 



^6 - X"" 



a — P 

(A -h Q^)(/X -h Q;) 



chosen for the straigthforward exploitation of the transformation of coordi- 
nates (5.83a). By substitution of and in equation (5.108) we obtain the 
following equations for A and M: 



A -h M — 0, (/X -|- o;)7l -|- (A “h q)]VI — 1, 



so that (5.108) becomes 
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By observing that (5.109) implies the condition 



we can quickly find its solution in the form 

A — fJj 



(5.110) 



where F{X) and G{ji) are arbitrary functions of their arguments. The pre- 
vious expression, displaying the form required to the potential, agrees with 
the expression of the potential in the statement of StackePs theorem (see 
(5.91)), as can be easily verified remembering that the elements of the metric 
for elliptical coordinates are given by (5.84). To find the explicit form of the 
function D we must integrate (5.105), with ^ given by solution (5.110). Ex- 
ploiting coordinate transformation rules (5.79), the transformation formulas 
for the gradient can be established: 



dX d 
dx dX 
dX d 



dfjL d 
dx dll 






d _ dX d dll d _ 2y 
dy dy dX~^ dy dy X — y 



(A -h o^) - 






Using these differential operators and expressions (5.106) for A and B, we 
find that one or another of (5.105) gives 

2 [(A + «4 - (M + «a|] J = [(A + ffl a _ c, 



(m + «a^J# 



and it is easy to verify that the function D, to satisfy the integrability con- 
dition, must assume the following form: 



D{X,li) 



uFiX) - AG(/i) 



The integral of motion (5.103) is therefore 

h = {y^ - P)pI + - a)pl - 2xyp^py + 2 



pF-XG 



Better still, by exploiting the transformation rules (A. 11) for momenta, to 
find the variables conjugate to the elliptical coordinates. 






aA 3A 
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we can express I 2 as a function of the canonical set of variables (A, 

h = ~ (5.111) 

A — fJj 

2) a = l, f = c, b = d = e — 0. This is not really a different case, since it is 
included in the previous general discussion of a 7^ 0, but it can be cast into 
a particular simple form by observing that it corresponds to the condition 
a = /?, which, on the basis of the treatment of elliptical coordinates in the 
preceding section (see the discussion following (5.84)), implies the choice, as 
coordinate system, of polar coordinates (r, (^). It is now possible to show, 
following a line completely analogous to the above, that the potential must 
be of the form 

${r,^) = F{r) + ^, (5.112a) 

which is known, in galactic dynamics, as the Eddington potential The corre- 
sponding integral of motion is 

■^2 == + 2G((/9), (5.112b) 

where p^ = which can be obtained from (5.9c) if we put, as a natural 
two-dimensional reduction of spherical coordinates, d = tt/ 2. In particular, 
in the case of axial symmetry (G ((/?) = 0), the fact that the integral (5.112b) 
is constant expresses the conservation of angular momentum, p^ = const. 

3) a = 0, d{ovb) 7^ 0, f ^ e = 0. Looking at (5.107), we see that this case 
the integrability condition is 

^^dxdy ^ ~ ~^J ~ ’ 



whose solution is 

^(x, y) = - [F{r + x) + G(r — x)l , (5.113a) 

r 

where F and G are arbitrary functions of their arguments. The second integral 
is of the form 

h ^ {yPx - xpy)py + x)G - {r - x)F ] . (5.113b) 



An example in this class was the resonant anisotropic oscillator (1.C.114). 

4) a = 0, f ^ c, b = d = e — 0. Looking at (5.107), we see that this case 
implies the condition 

^=0, 

dxdy 

whose solution is of the form given in (5.98). Therefore the integral is 
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l2=c{pl + 2^,)+f{pl + 2^2). 

To write the integrals in the form given by (5.99), it is sufficient to observe 

, 2fE-h r _h- 2 cE 
" f-c ’ '' f-c ' 

where the total energy E is the value assigned to the first integral Ii. 



5.6 The Problem of Two Fixed Centres in the Plane 



The problem of two fixed centres of attraction was already studied and rec- 
ognized as being separable by Euler in 1760 and was completely solved by 
Jacobi. It is the most renowned of the plane integrable problem (after, of 
course, the Kepler’s and the harmonic oscillator problems) and is an example 
well suited to illustrating the integration of the equations of motion by means 
of the method of the separation of the Hamilton- Jacobi equation in elliptical 
coordinates, in agreement with the technique presented in Sect. 5.4. 

We state the problem in the following manner:^^ we consider the motion 
of a body (mo) attracted, under the laws of Newtonian gravitation, by two 
fixed bodies of masses mi and m 2 , restricting the study to the case in which 
the motion is confined to a plane containing the two centres of attraction. 
Assuming that their Cartesian coordinates are (0, c) and (0, — c), and that 
those of mo are in general (x,^), the potential acting on mo will be 



^{x,y) = -G 



mi m2 

ri{x,y) r 2 {x,y) 



where the distances of the moving body from the centres are 
ri = yjx‘^ + (?/ - c)2, V 2 = \/x2 + (^ + c)2. 



(5.114) 



We attack the problem from the outset by passing to elliptical coordinates: 
the aim is to express the potential in the form of (5.91), as required in the 
statement of Stackel’s theorem, so that the solution of the dynamical problem 
is reduced to the integration of equations of the form of (5.95-97). 

The sum and the difference of the squares of the distances of the moving 
body from the centres are respectively 

rl+r\ = 2{x^ +y^ rl~rf=4cy, 



so that, recalling formulas (5.83) for the transformation between Cartesian 
and elliptical coordinates, the last equalities can be rewritten as 

A. Pars: A Treatise on Analytical Dynamics (Heinemann, London, 1965). 
^®This is clearly not a severe restriction since the problem is invariant with respect 
to rotation around the axis connecting the two centres: in the following section 
we will exploit this fact to study the full three-dimensional case. 
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^2 + ^1 — 2(A + /i + 2/3), 
rl - rl = (A + /3)(/x + ^), 



(5.115) 



with the definition 

a. (5.116) 

According to the discussion on the properties of elliptical coordinates follow- 
ing (5.84), it is easy to see that the position expressed by (5.116) implies 
putting the two attracting bodies at the foci of the coordinate system. De- 
composing the difference of the squares of the distances, we find that it is 
straightforward to obtain the simple relation 



^2 — \/ A -h /3 -h \//i -f /3, 
Vi — P — yj i^-\- P- 



(5.117) 



Putting these into expression (5.114), we see that the potential indeed has 
the form required by Stackel’s theorem: 

m^y/JTVp - m^y/XTP 

^(A,m) = 7 , 

A — jJi 

where the two new “masses” 



md = G{rri2 - mi), mg = G(m2 + mi) 



have been introduced. We assume, by definition, that m 2 > mi, so that we 
have the inequalities 

ms > md > 0. (5.118) 

Exploiting form (5.84) of the metric components in elliptical coordinates, the 
Hamilton- Jacobi equation 



n 





-h ^(A, /x) — a\ 



(which is equivalent, in two dimensions and in this coordinate system, to 
(5.92)) becomes 

2(A + a)(A + (5) + a)(M + /?) 

-hmd P - ms\/A -h P = (X- fi)E, 

where a\ — E, the total energy, is the first integration constant. If the com- 
plete integral (5.93) is written in the form 

5* = 5A(A) + 5^/i), 



(5.119) separates into the set of ordinary equations 




5.6 The Problem of Two Fixed Centres in the Plane 343 



2(A + a)(A + /?) 
+ a)(/i + f3) 



2 

= jE/(A + /?) + TTls y/ X P — I 2 -, 



2 

= E[fi /3) md v^/i + /? — I 2 



(5.120) 



where h is the second integration constant. The equations of motion, cor- 
responding to (5.95-97), general solutions according to the hypothesis of 
Stackers theorem, are now 



1 f y/XT~PdX 1 f \/fl + Pdfi 

2 J ^2(A + a)L(A) 2 J ^2(/i + a)M[/j.) 

1 j' dX f 

2 J y^2(A + a)(A + /?)L(A) 2 J -y/2(/x -[- o^)(/i + P)M(fi) 



(5.121) 

(5.122) 



Pa 



i^(A) 

2(A + ck) (A -[- /3) 



Pm = 



M(/i) 

2(/i + a)(/x + /?) 



(5.123) 



where have been introduced the two new functions 



L — P/(A -f- /3) + TTls y/ X f3 — I 21 

M = + /3) + a/p H" /? ~ -^2 5 



(5.124) 



which, if we look at the right-hand sides of set (5.120) and remember that 
the coordinates must satisfy inequalities (5.82b), are such that, during the 
motion, 

L > 0, M < 0. (5.125) 

It is instructive, at this point, to perform a qualitative analysis of the 
orbits in this problem. With this aim in mind it is sufficient to find the 
zeroes of the functions appearing in the denominators of (5.121-123) and to 
determine the intervals that the coordinates fall in. To facilitate this study it 
is convenient to make the change of variables 

+ i=y/iT+p, (5.126) 

such that the inequalities mentioned turn out to be 

^ > c, c> L> —c. (5.127) 



Definitions (5.124) simply become^^ 



^^The choice of the negative sign in front of I 2 is made with the intent that it 
coincide with integral (5.111) of the previous section. 

^^Recalling the relations written after (5.87), we see that the coordinates (^, ^) are 
related to the coordinates (u,u), generated by the conformal transformation of 
(5.86), by ^ = Zicoshu, l — Z\sinu. 
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L = + nisi — /2, 

M — Ei^ + m^L — I 2 . 

Let us denote by ii, £2 the zeroes of L, and by ^ 1 , L 2 the zeroes of M; ^ 1 , £2 
are certainly real numbers. If, in fact, they were complex, the coefficients of 
the trinomial L{£) should be such that 

rUg < -AEI2, 

and accordingly, as a result of (5.118), the coefficients of M{t) should be such 
that 

ml < -AEI2, 

so that ^ 1 , i 2 too would be complex. But this is impossible, since if both L 
and M had complex zero, they would both have the same sign as E, and so 
one of inequalities (5.125) would be violated. The condition that ^ 1 , £2 be 
real can be used as the basis for the orbit classification, instead of using, for 
example, the values of the constants of the motion E^ I 2 . 

We limit ourselves to the case E < 0 {bound orbits). Inequalities (5.125) 
then imply that, since the first coefficients of the trinomial are negative, £ 
must be inside the interval (£ 1 , £ 2 ), whereas t must be outside the interval 
(^ 1 , 12 ) (in the case for which li and L 2 are real). Let us therefore look for 
the critical curves in the £i £2 plane, assuming, by definition, that £i > £2 
always. This assumption and the fact that the sum of the zeroes 

7T7 

(5.128) 

must be positive for the bound orbits, eliminate, in the £i £2 plane, the whole 
region lying on the left of the two straight lines £2 ^ £i == 0 (see Fig. 5.3). 
Putting (5.128) and the equation giving the product of the zeroes 

4^2 = ^ (5.129) 

into the equation M{l) = 0, we find that ci and L 2 are the roots of the 
equation 

{£i -p £2)x + £i£2 = 0. 

rus 

The condition ti — 12 is satisfied if 

{ii + ^ 2 )^ = 4£i£2{ms/md)^. 

This, if we introduce the variable 0, such that 

— = cosh e, —= tanh^ -9, 

md m 2 2 



£j_ 

£2 



becomes 



,±20 



(5.130) 
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representing a pair of straight lines with the same slope with respect to the 
axes of the £2 plane (denoted by ti = i 2 in Fig. 5.3). 

Other critical curves, useful for orbit classification, are given by the con- 
ditions Li — c or i 2 — c, which, proceeding as before, correspond to the 
equation 

c2 - —(^1 + £ 2 )^ + £\£2 = 0, 
rus 

that is, to the hyperbola of equation 

{£i — cmd/ms)(^2 — cmd/rris) = —c^ tanh^ 9. (5.131) 

The tangents to this hyperbola are the straight lines (5.130); in particular, 
the one with the positive exponent touches the lower branch of hyperbola 
(5.131) at the point 

T= (ce^ ce~^). 

To determine which part of (5.158) corresponds to /.i = c and which to L 2 = c, 
we assume first of all that ti > 12 - Now, if L 2 = c, then + ^2 ^ 2c implies 
the inequality 

-^1 +^2 ^ — 2c cosh 0, 

so that i 2 is equal to c on the right of T. An analogous argument proves that 
— c or ^2 correspond to parts of the lower branch of the hyperbola 

[£i + cmd/ms)(^2 + cmd/rus) = -c^ tanh^ 6. (5.132) 

Both hyperbolas (5.131) and (5.132) pass through the point P = (c, — c). 
This completes the determination of the critical curves, although it remains 
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to eliminate some other regions in the ^2 plane. From the first of equations 

(5.127) we see that the semi-plane < c must be removed, since, by (5.125), 
£ must lie inside the interval (^i, ^ 2 )- Moreover, from the second of equations 

(5.127) , and since, from what we said above, in the case of real zeroes for the 
trinomial M(^), l must lie outside the interval (^ 1 ,^ 2 ), we must exclude the 
region for which ii > c> —c > L 2 . Once the regions not consistent with all 
the above conditions have been removed, there remain, for the bound orbits, 
the regions numbered from 1 to 4 in Fig. 5.3, representing four different types 
of orbits. In region 1 we have £i > £2 > c, so that the £ motion is given by a 
libration between £ = £i and £ = £ 2 . In regions 2, 3 and 4 we have £2 < c, so 
that the £ motion is given by a libration between £ — £i and £ — c.ln regions 
1 and 2 the zeroes of M are either complex or real but with ti > L 2 > c, so 
that L can take any value in the range between — c and c. In region 3 we have 

0 Li > L2 > — c, 

so that there are two possibilities for the i motion, either a libration between 
c and or a libration between t 2 and — c. In region 4 we have 

Li > 0 L2 > — c, 

so that the only possibility for the l motion is a libration between L 2 and — c. 
To summarize, in region 1 the orbit is then a curve (in general not closed) 
which, after sufficient time, densely fills up the oval annular ring contained 
between the ellipses £ = £\ and £ = £2 (see Fig. 5.4). In region 2 the orbit is 
figure- eight shaped around both centres and, after enough time, densely fills 
up the ellipse £ ^ £i (see Fig. 5.5). In region 3, the moving body becomes the 
satellite of one or the other of the fixed masses (see Fig. 5.6), since, shifting 
from parameters of the region 2 to those of region 3, the figure- eight has 
broken into two distinct curves. Shifting then from region 3 to 4, we find that 
one of them disappears and the moving body can only be a satellite of mi . 





/ \ 




Fig. 5.5 



Having worked out the qualitative analysis, we can move on to the explicit 
integration of the orbits. We limit ourselves to a specific example: the case 
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of the elliptical annulus of Fig. 5.4. From (5.122) we see that the trajectory 
is determined by the equation 



dX dfi 

>/(A -f a){\ + (3)L{X) \/(m T T f3)M{fi) 



(5.133) 



The relation between momenta and velocities (A. 11) and the form of the 
metric coefficients (5.84) lead to a parametrization of the trajectory in the 
form A = A(r), // = /^(t), where the new parameter r is introduced by means 



V2{x-,,y 

However, the coordinates l are more useful also for the integration of the 
equations of motion. In fact, the functions L{\) and M[fi) are irrational, 
whereas, expressing (5.133) in terms of the coordinates we find that 



d£ 

^{£i-e)i£-e2)ii-c){i + c) 



di 

\/{c- l){l + c){ii - l){l 2 - l) ’ 



(5.134) 



where the intervals for the variables are those found before. Equating both 
sides of (5.134) to 2 dr, where now, from (5.126), 



, dt 

dr = 

\/2(£2 - 

and using Jacobi elliptical functions, we find that we can parametrize the 
orbits in the form £ = £{r)^ i — One method is the following: by means 
of the change of variable 

£ — £2 £2 — c 

£\ — £ £\ — cl — 



the left-hand side of (5.134) takes the form 



2 dx 

\/{h - c ){£2 + c ) y(l -x2)(l -Bx^y 
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where 



e = 



2c(£i - £ 2 ) 



{ii -c){£2 + c)' 
Introducing the Legendre elliptic function of the first kind^^ 

d^ 



(5.135) 



/ 



where u is related to the parameter r by the relation 



U = V(£i -c)(£2 + c) r, 

we know that the functions snu and cnu, called, respectively, the Jacobi 
sine-amplitude and cosine-amplitude functions, result implicitly defined by 



X = snu, \/l — x^ = \/l — sn^u = cn u. 

Therefore, the solution for £ can be expressed in the form 

£i (£2 - c)sn^n + - c)cn^n 

^ ^ (£2 — c)sn2zt + (£1 — c)cn^u 

Analogously, if we use the change of variable 

L-\-c ii-\- c 



(5.136) 



c — i ^ 1 — cl — 2 /^’ 

the right hand side of (5.134) takes the form 

2 dy 



where 



y(ti - c)(<,2 + c) \/(i - 2/^)(i - 
2 _ 2 c(ti - 12) 

K — 



(tl -C)(i2 + C)' 

Therefore, following the same procedure as above, 

y = snv, u = (t - To) 

and 



(5.137) 






(^1 + c)sn^t; — (ti — c)cn^v 



(5.138) 



(ti -h c)sn^t; -h (^^1 — c)cn‘^v 

The orbit is now parametrized in the form £ = £(r), l = l{t). The parameter 
for the Jacobi functions appearing in (5.136) is defined by (5.135), whereas 
the parameter of the functions appearing in (5.138) is defined in (5.137). 



^^See, for instance, E. T. Whittaker, G. N. Watson: A Course of Modem Analysis 
(Cambridge University Press, 1927), pp. 512 ff. 
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5.7 Axially Symmetric Potentials — 
Motion in the Potential of the Earth 



We now return to the study of fully three-dimensional orbits, considering 
potentials of the kind ^ where, having introduced the cylindri- 

cal coordinates defined in (A. 8a), we assume that the potential does not 
depend on the azimuthal angle (p and is endowed with rotational symmetry 
around the 2 : axis. This case is of paramount importance for astrophysical ap- 
plications, since potentials of this form are generated by mass distributions 
provided with rotational motions around the symmetry axis, typical of solid 
celestial bodies, stars, disk galaxies and also common, if not dominating, in 
elliptical galaxies. 

A glance at the equations of motion (A. 17), suggests from the outset some 
of their peculiarities. The angular momentum around the z axis 



L 2; — Pip — 



(5.139) 



is constant, since the corresponding conjugate variable is ignorable. Equation 
(A. 17) then reduces to 

w — — 






dw' 






(5.140) 



which are the equations of the motion in a symmetry plane rotating with 
angular velocity given by = Lzjvo^ {meridional plane) and can also be in- 
terpreted as the equations of motion associated with a new effective potential 



^eff = ^ + 



A 



(5.141) 



Since the problem is in this way reduced to two degrees of freedom, the motion 
in the meridional plane is a problem to which the general methods illustrated 
in the previous sections apply. Let us assume then that the reduced problem is 
separable and that, in addition to the energy there exist a second integral 
I 2 . Choosing a suitable coordinate system, for example the prolate spheroidal 
coordinates (A, (p^ u) defined by equations (5.89), given the values of I 2 
and Lz, we can find the area in the meridional plane allowed to the orbit. 
This area, say A, can be found, as usual, cutting the level curves of the 
effective potential (5.141) with the line E constant. The axially symmetric 
volume obtained by making A rotate around the symmetry axis defines the 
orbit in the space, because the orbit densely fills all the allowed volume the 
frequencies of the motions in A, z/ and ip being in general incommensurable. 
Once the motion in A and z/ has been determined, that in (p follows from 
(5.139) written in the form 
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A particularly interesting example of motion in a spheroidal potential is that 
of motion in the potential of the Earth, which finds a useful application in 
the determination of the orbits of artificial satellites. 

In general, if we have a finite mass distribution endowed with rotational 
symmetry, the external potential can be written as the series expansion 



GM 

r 



l + '^Ak 

k =2 



R 

r 



k 

Pk{sinb) , 



(5.142) 



where M is the total mass, R is the equatorial radius (defined by b = tt/2, 
where b is the latitude), r is the distance from the centre of mass, where the 
origin of the coordinate system is set, Pk are the Legendre polynomials and 
Ak are the coefficients of the expansion. In the case of the Earth, from studies 
of the motion of satellites, it has been found that the first three coefficients 
are 



A 2 - -1.0826 X 10■^ As = 2.5 x 10~^ A 4 = 1.6 x 10“^ (5.143) 

Since in the case of spheroidal symmetry the odd terms are missing in expan- 
sion (5.142), we see that the Earth has a non- negligible pear-shaped distortion. 
It is clear that we cannot hope to solve the problem of the motion in the po- 
tential (5.142) exactly if the order of the expansion is arbitrarily high, but, 
if it is truncated at a sufficiently low order, it can be solved by an ingenious 
trick. Let us again take the two fixed centres problem of the previous section 
and suppose that the two masses mi and m 2 are put at the points z — C\ 
and 2 ; — — C 2 - We now want to study the three-dimensional problem, so that 
we write the potential in the form 



^{x,y,z) = -G 
= -G 



mi 






m 2 



r\{x,y,z) r 2 {x,y,z) 

mi 



+ 



m 2 



-h 2/2 + (2: - CiY -h 2/^ -h (2; -h C2Y ) 

Exploiting the series expansions 



r 

T2 



Pn{Anh), ^ = £( 7 ) Pn{Anh), 



n=0 



n=0 



the potential can be written as 

G{mi+m2) 






k = l 



(5.144) 



where the coefficients 7 ^ are defined by 

miCi -f m2(-C2)^ 
Ik = • 



mi -h m2 



(5.145) 
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In view of the resemblance of (5.142) and (5.144), the procedure is ap- 
parently straightforward: if we succeed in identifying the coefficients 'jk with 
AkR'", we have a complete coincidence of the two cases and therefore the 
problem of the motion of the satellite is solved, since it has been reduced to 
an integrable problem. Unfortunately, in expression (5.145), there are only 
four really free parameters which we can use to make 7 ^, and not infinitely 
many as Ak. We can therefore realize the coincidence only up to four terms 
in the expansion: if, as is obvious, we choose the first four terms, we have the 
relations 

mi + m 2 = M, 

mici - m2C2 = 0, 

2 , 2 A/r A ( 5 . 146 ) 

m\Ci + m2C2 = MA2R , 

micl — m2cl = MAsR^, 

which can be solved for c. 




i- _i i_ 

(5.147) 

and for m, 

mi = M — — — , m 2 = M — — — . (5.148) 

Cl +C2 Cl + C2 

Recalling the numerical values of A 2 and A 3 , we see that in the argument 
of the square roots of (5.147), the negative term 4 A 2 dominates by three 
orders of magnitude: the distances Ci and C 2 (and therefore also the masses 
mi and m 2 ) turn out to be complex numhers\ This result, surprising at first 
glance, need not worry us: what matters is that the potential is real and thus 
the related motion. We thus arrive at the conclusion that the motion of a 
satellite in the field of the Earth is reduced to Euler’s classic problem, with 
complex masses put at complex distances (real masses at real distances would 
give a prolate spheroid instead of an oblate one). If, moreover, in (5.147) and 
(5.148), A 3 is neglected, that is, not taking into account the asymmetry of the 
two terrestrial hemispheres with respect to the equatorial plane, we obtain 



mi=m 2 = -M, ci=ic^ iR\/\M\, C 2 = -ic = -iRy/\M\, 



so that the problem becomes simply 



1 1 

-\- {z — ic)^ -h (z + ic)^ 

Without entering into the details, we can summarize the results obtained 
as follows. There are many autonomous dynamical systems (of very real inter- 
est in celestial mechanics or in other fields of analytical mechanics) in which 
the existence of non-classical isolating integrals, such as 



z) 



GM 
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^ A — /i 

of the two fixed centres problem, is suggested by several peculiarities dis- 
played by the orbits of the system, but of which it is impossible to find the 
explicit form. 

Still in the class of the potentials we are examining in this section, the 
most interesting case is certainly that of the potential of our Galaxy, which 
is imagined to be a rotationally symmetrical mass distribution, very roughly 
represented as a thin disc embedded in a huge spheroidal halo. Without going 
into all the details, we simply wish to recall that, from the classical works 
by Kapteyn, Jeans and Oort,^^ the study of the kinematics of stellar motion 
in the neighbourhood of the Sun and the first attempts to construct equilib- 
rium models for the Galaxy provided compelling evidence for the existence 
of a non-classical integral of the motion (the celebrated problem of the third 
integral in galactic dynamics): in particular, the triaxial anisotropy of the 
velocity dispersion of the stars in the disc cannot be described by galactic 
models constructed on the basis of distribution functions that depend only 
on the energy and the angular momentum. The potentials we have exam- 
ined (the Stackel cases, essentially) do not allow great fiexibility for careful 
modelling of the galactic disc. Nevertheless, they can be considered to be the 
paradigm of the dynamical structure of the galaxies. To deal specifically with 
some real cases, we need to shift to perturbative techniques, which we will 
study in Volume 2. 



5.8 Orbits in Triaxial Potentials 

When studying motion in potentials endowed with axial symmetry, we men- 
tioned some of the peculiarities displayed by the orbits supported by them. 
It is clear that, if we remove the axial symmetry, but maintain the symmetry 
of refiection with respect to every coordinate plane and remain within the 
conditions in which the Hamilton- Jacobi equation is separable, the charac- 
teristics of the orbits are even more varied and interesting: we dedicate this 
section to the analysis of motion in a generic triaxial Stackel potential, that 
is, a potential of the form introduced in the statement of Stackel’s theorem 
in the second of equations (5.90). Other aspects of the orbits in the axially 
symmetric case will emerge as limiting cases, without the necessity of boring 
recapitulations. The general expression for the potential is 

^®J. C. Kapteyn: First attempt at a theory of the arrangement and the motion of 
the sidereal system, Astrophys. J. 55, 302-327 (1922); J. H. Jeans: Astronomy and 
Cosmogony (Cambridge University Press, 1928); J. H. Oort: Stellar dynamics, in 
Galactic Structure, ed. by A. Blaauw, M. Schmidt (University of Chicago Press, 
1965), pp. 455-530. 
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FW ■ 

(A - /i)(A - u) 



F{^i) 



Fji^) 

{v-x){i'- nY 



(5.149) 



where, for the functions in the numerators, we have used, for simplicity, the 
same symbol since, from (5.77), the three variables A, /i, v each ranges 
over its own interval, and these intervals do not overlap and cover all the 
real axis from —7 to + 00 . We can also use the notation F = F{r)^ with 
r representing, in turn, the three coordinates A, /x, v. For the solution of 
the Hamilton- Jacobi equation, the definition of the isolating integrals and 
the qualitative analysis of the orbits, we will follow closely the treatment by 
de Zeeuw,^^ whose elegant and exhaustive work can be considered, within 
reason, the standard reference for this problem. 

The Hamiltonian is 



\ {P-Vx + Q~V^ + R~V.) + ^(A, /i, = E, (5.150) 

where P, Q and R are given by (5.81). Putting pr — dS*/dr, r = X, fi, u 
into (5.150), we obtain the Hamilton- Jacobi equation. With the potential of 
(5.149), after multiplying it by (A — /x)(/x — — A), we find 



in-u) 






2(A + a)(A + /?)(A + 7)(^j -F(X)-X^E 

/ /) cat \ 2 

2{n + a){ii + + - F{n) - ^YE 



+ (A — m) 



2{i/ + a)(i' + (5){v + 7 ) 



dS* 

du 



2 

- F{u) - u^E 



= 0 . 



(5.151) 



Assuming, as usual, that the solution is of the form 
5 *(A,m,i.) = 5a (A) + 5^(/x) + 
and defining the function 

P(r) = 2(r + a)(r + /3)(r + 7 ) -F{r)-r‘^E, r ^ X, ji, z/, (5.152) 



we see that the Hamilton- Jacobi equation (5.151) then simply becomes 

(/i - v)U{X) + (z/ - A)P(/i) + (A - p)U{v) = 0. 

This equation must be satisfied for every value of A, /x and v. Its partial 
derivatives with respect to each coordinate imply that U'{r) is a constant, 
so that 

U{t) = 12T - is, 

^^de Zeeuw: loc. cit., Footnote 17, Sect. 5.4. 
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where Z 2 and 23 are two constants. Putting this into the Hamilton- Jacobi 
equation, we arrive at the equations of motion: 



dSr _ _ r‘^E - T22 + 23 + Fi'^) 

dr y 2 (r -h a)(r + / 3 )(r -f 7) ’ 



(5.153) 



which is the realization of (5.97) in the present case. 

The constants 22 and 23 appearing in the equations of motion are the 
values of the two isolating integrals, I 2 and 73 , admitted by the problem in 
addition to the energy H — E. Their explicit expressions in terms of phase 
space coordinates can be found in the same way that we found I 2 in the 
axially symmetric case before: we solve (5.153) for 22 and 23 and substitute 
in place of E the explicit form of Hamiltonian (5.150). The easiest way to 
write the ensuing expressions is, however, by means of the quantities X, Y 
and Z defined by 

y _ pI m 

2P2 (A-^)(A-I/)’ 

y pI m 

^ pI F{v) 

2 B? {u — X){iy — 11)’ 

With a little algebra, we then find the following relations giving the isolating 
integrals: 



(5.154a) 

(5.154b) 

(5.154c) 



+ Y + (5.155a) 

-^2 — (a^ “i" {ly X)Y + (A + {Y)Z^ (5.155b) 

Is = fiyX + vXY + XfiZ. (5.155c) 



We know that every function of 77, 72 , Is is again an integral of the motion. 
The freedom of choosing three arbitrary independent functions of 77, I 2 and 
Is can be exploited to simplify the qualitative analysis of the motion. To this 
end it is convenient to keep the energy 77 = E" as an integral, but, instead of 
I 2 and 73 , it is more useful to introduce the quantities J and K defined by 

J ^ ^H + ah + h^ ^ ^ (5 

a — 7 7 — a 

Among other things, with these functions it is easier to establish the links 
between the triaxial case and its axial and spherical limits. In terms of the 
values of j and k of the integrals J and K, the equations of motion (5.153) 
become 



dSr _ _ j (r + q)(t + -f)E - (t + 7)j - (r + a)k + F{t) 

dr y 2(r + a)(r + /?)(r + 7) 




5.8 Orbits in Triaxial Potentials 355 



For the qualitative analysis of the bound {E < 0) orbits, by analogy with 
what was said in Sect. 5.7, we have to find the conditions such that, for given 
values j and k of the integrals, the three quantities and are not 

all negative. As a consequence, the allowed intervals for the coordinates A, 
/i and u will be determined and the motion will result as the combination 
of librations between the inversion points = 0 and, if p^ > 0 for every r, 
rotations. According to the type of combination of librations and rotations, 
a specific family of orbits will ensue. In particular, the intervals delimit a 
volume in physical space, with different shapes for the various families. Since 
in general the frequencies of the motion in each of the three coordinates are 
not commensurable, the orbits densely fill the allowed volume. A complete 
classification can be performed determining, by means of (5.157), for every 
combination of E^ j and /c, whether an orbit exists, and if so to what family 
it belongs. To this end it is convenient to rewrite (5.157) in the form 



E - j/{r a) - k/{r ^ G{r) 
2(r + /3) 



(5.158) 



where the function G{r) is defined as 



G(r) = 



F{r) 

(r + a)(r + 7) ' 



Rearranging its form, we can cast (5.158) in the enlightening form 



E = 2{T + (3)pl + Ves{T), (5.159) 



where we have introduced the effective potential 



Kff(r) 



J 

r + a 



k 

_| 

r + 7 



G{r). 



(5.160) 



Equation (5.159) has the familiar form of the energy equation for one- 
dimensional motion in the effective potential (5.160). It differs from the stan- 
dard form in one important respect: the usual condition, for which the motion 
is permitted in the regions in which the energy is greater than the effective 
potential, is true for A and //, but not for z/, as can easily be deduced by 
considering that the factor multiplying p^ would be negative in such a case. 
Summarizing, we can say that p^ is positive (and therefore the motion is 
possible) if z/, /i and A satisfy the inequalities 



E < Eeff(z/), E > 14ff(/x), E > Eeff(A). (5.161) 



The procedure is therefore to plot, given the values of J and AT, the curve of 
the effective potential. With this in mind, it is necessary to assume a form 
for G(r): since our main interest is in self-gravitating systems, it is sensible 
to assume that G(r) is a finite positive function, monotonically decreasing 
over the whole range —7 < r < + 00 . Comparison with the line E = const. 
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taking into account conditions (5.161), gives directly the intervals available 
to the motion. A glance at the plots in Fig. 5.7 shows that the orbits can exist 
only in the case A: > 0, but since we have to distinguish between the cases 
E and E < Kff(— /?), there are four possible combinations giving 

rise to four different types of orbits. 

a) Boxes. In the cases 

j < 0, A: > 0, Kff(-/?) < ^ < 0, 

from Fig. 5.7a, it is easy to see that the allowed intervals for the coordinates 
are 

^ ^ ^max5 ^ ^ /^max5 — O < A < A^iax* 

Recalling the properties of ellipsoidal coordinates (Sect. 5.4), we see that the 
projection of the orbit on the xz plane must always be inside the area bounded 
by the focal hyperbola (5.80b), whereas the projection on the yz plane must 
always be inside the area bounded by the focal ellipse of (5.80a) (see Fig. 5.8). 
The motion is given by the superposition of three librations in each coordinate 
so that, following Schwarzschild,^^ we call this type of orbit a box-orbit 




-r-P -a (c) 




-r-p -a (d) 




Fig. 5.7 



Schwarzschild: A numerical model for a triaxial stellar system in dynamical 
equilibrium, Astrophys. J. 232, 236-247 (1979) 
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Fig. 5.8 



b) Inner Long Axis Tubes. In the cases 

j < 0, A: > 0, < E < I4ff(-/5), 

where /io is the value of fi for which has a minimum, from Fig. 5.7b we 
see that the allowed intervals for the coordinates are 

T — ^ /^min ^ ^ Mmax? ^ ^ ^ ^ -^max’ 

The orbit is always around the x axis. Its projection on the xz plane must 
be outside the area bounded by the focal hyperbola (5.80b), whereas the 
projection on the yz plane must always be inside the area bounded by the 
focal ellipse (5.80a) (see Fig. 5.9). The motion in z/ is a rotation and we call 
it a tube orbit around the x axis. Since it crosses the 2 : axis between the inner 
foci (those of the focal hyperbola) at z = ibv'y - (3 and the outer foci (those 
of the focal ellipse) at z = zb a/7 — a, we call it an inner long axis tube. 

c) Outer Long Axis Tubes. In the cases 

j > 0, /c > 0, Kff(Ao) < E < Vq^{-(5), 

where Aq is the value of A for which has a minimum, from Fig. 5.7c we 
see that the allowed intervals for the coordinates are 

7 — ^ /^min ^ ^ '^min ^ '^max* 

The orbit is always around the x axis (see Fig. 5.10). Its projection on the xz 
plane must again be outside the area bounded by the focal hyperbola (5.80b) 
and the projection on the yz plane must always be outside the area bounded 
by the focal ellipse (5.80a). The motion in v is again a rotation around the x 
axis, and since it crosses the z axis outside the outer focf we call this kind 
of orbit an outer long axis tube. 
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Fig. 5.9 



d) Short Axis Tubes. In the cases 

j > 0, A: > 0, Kff(/?) (or Kff(Ao)) < ^ < 0, 

where Aq is the value of A for which Vqh has a minimum, from Fig. 5.7d, we 
see that the allowed intervals for the coordinates are 

T — ^ — ^max5 ^ ^ O!, Amin ^ A ^ Amax* 

The orbit is always outside and around the z axis (see Fig. 5.11). Its projection 
on the xz plane must be inside an area bounded by the focal hyperbola (5.80b) 
and the projection on the yz plane must be always outside the area bounded 
by the focal ellipse (5.80a). The motion in ji is now a rotation around the z 
axis and we call this kind of orbit a short axis tube. 

In summary, the point to emphasize about the above results is that the 
“zoology” of orbits we have illustrated above is highly representative of the 
orbits we observe in triaxial potentials considered as realistic approximations 
of elliptical galaxies. 




Fig. 5.10 
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Fig. 5.11 



5.9 Configurational Invariants 

Up to now all that we have done in the quest for integrability^^ is intended to 
be valid for all the values of the energy of the system. This point is implicit in 
the general condition for the existence of first integrals /(p, g), namely that 
their Poisson bracket with the Hamiltonian vanishes, 

(/,H)-0. 

A function on phase space is a first integral if and only if this condition is 
satisfied and this is true for every value of the energy. We can therefore speak 
of integrahility at an arbitrary energy. 

Let us see how things change, with respect to what was discussed in 
Sect. 5.5 regarding integr ability at arbitrary energies, if we search for inte- 
grability at a fixed energy in the case of the classic two-dimensional system. 
For the sake of simplicity we consider only the case of the second invariant 
linear in the momenta, but we soon enforce the condition 

\ {pI + pI) + y) = Eo, (5.162) 

and use it consistently from now on. This is accomplished by eliminating one 
of the momenta; we choose Py, by means of relation (5.162) written in the 
form 

p1^2{Eo-^)-pI (5.163) 

The second invariant of the form (5.101), with the expression of the conser- 
vation of energy explicitly included, is now written 

h = A{x,y)Px + B{x,y)y/2{Eo - «?) - pi- (5.164) 

®®See M. Tabor: Chaos and Integrahility in Nonlinear Dynamics, (Wiley, New York, 
1989), p.322. 
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Computing its Poisson parenthesis with namely the condition that it be 
identically zero for every value of and Px-, we arrive at the following 
relations: 



dA dB 



dx dy 




AB 



dA dB „ 

— 0, h — 0, 

ay ox 


(5.165) 


, ,dB 






(5.166) 



It is clear that this set of equations allows for a larger class of potentials if 
compared with (5.102). It is important to observe that if (5.166) were required 
to hold for every value of the energy, this would imply that dB/dy = 0, so 
that the system of (5.165) and (5.166) would reduce to system (5.102). This 
simple but important point further clarifies the issue of integrability at fixed 
energy. In other words, this means that (5.165-166) are the necessary and 
sufficient conditions for the existence of an integral at fixed energy, but they 
are only necessary (and not sufficient) for the existence of an integral at 
arbitrary energies. 

A simple example which is general enough to shed light on the whole 
matter is the following. Let us choose as a simple solution of (5.165) that 
with A — X and B = y, so that the second integral is 



h = xpx + ypy. 



(5.167) 



For definiteness let us fix the energy at the value Eq = 0, so that, for (5.166), 
the potential must satisfy the equation 






+ 2<i> = 0. 



The solution of this equation is 



^{x,y) = x ‘^f{y/x), 



with / an arbitrary function, so that it is straightforward to verify that 



/2 = (/2,H) = 2F;o, 



(5.168) 



where T-L is taken to assume arbitrary values. This is direct verification that I 2 
is a first integral only on the fixed energy surface E'o = 0. An expression like 
(5.168) can be seen as the condition for a weak involution property which is 
reminiscent of the more general situation of Liouville’s theorem for systems 
with a number of integrals in strong involution equal to the number of degrees 
of freedom (see Sect. 1.18). I 2 is said to be a configurational invariant 

^^This definition was introduced by W. Sarlet, P. G. L. Leach, F. Cantrijn: First inte- 
grals versus configurational invariants and a weak form of complete integrability, 
Physica 17D, 87-98 (1985). 
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It is very interesting to develop the treatment of configurational invariants 
in a geometric framework, instead of the analytical approach followed before. 
The appeal of the geometric picture is twofold: first, in the simplest cases the 
conditions for the existence of the invariants are related to the symmetries 
of the dynamical problem through Noether’s theorem (see Sect. 1.6); second, 
if we represent the trajectories of the system at a fixed energy Eq as the 
geodesics on the manifold endowed with the Jacobi metric, the Killing equa- 
tions (1.B.41) and (1.B.42) are directly applicable and show all their power. 
If we exploit the Jacobi form of mechanics (see Sect. 1.8), we can obtain the 
equations of motion for a conservative dynamical system from the Lagrangian 

^ = ^gikQiqk, 9ik = - ^). (5.169) 

Requiring, for example, the invariance of the action integral with respect to 
a transformation of the form 



9k = qk^ik{q)e, 



following the general analysis developed in Sect. 1.6, we find that the invari- 
ance is guaranteed by the condition 



dqi dqi dqj 



(5.170) 



With a Lagrangian of the “kinetic” form of (5.169) it is easy to verify that 
(5.170) can be written in the standard form of the Killing equation: 



^9ik 

dqj 






dQ 



dqi 



= 0 , 



(5.171) 



which is well-known in Riemannian geometry, and that the invariant can be 
written in the form 

I = Qijijqk = Pk^k, (5.172) 

where we have used the duality relation between the velocity and momentum. 
We therefore see that, in the case of a symmetry related to the existence of the 
Killing vector the invariant linear in the momentum (and so of the form 
of (5.101) or (5.164)) emerges in a straightforward way. It is then natural 
to think that the conditions imposed by the Killing equations (5.171) are 
equivalent to system (5.102) (or (5.165-166)). In fact, with the Jacobi metric 
of (5.169), Killing equation (5.171) gives the set of conditions 



2(^0 - ^)^{^,k) - V^k^AJ = 0 . ( 5 . 173 ) 

where we have used the simplified notation of commas to denote partial 
derivatives and 



= ^i,k + ^k,i^ 
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Consider again the case of two degrees of freedom. If we want conditions 
(5.173) to be satisfied for every value of the energy, we find that 

0 . == 0 . ( 5 . 174 ) 

which, by the way, are valid for any number of degrees of freedom. But if we 
require that conditions (5.173) are satisfied on the fixed J^o surface, we find 
that 



Cl, 2 + ^2,1 = 0 , Cl,l = C2,2, (^0 - ^)C2,2 - = 0 . (5.175) 

Defining Ci = ^ and C 2 = B, so that (5.172) agrees explicitly with (5.101), we 
see that (5.174) coincides with the general case (5.102), whereas (5.175) are 
the same as (5.165-166). The geometric interpretation of this situation is that 
the “family” of surfaces endowed with the Jacobi metric, at some particular 
value of £" 0 , “takes a shape” that allows for the existence of a Killing vector 
(which is the generator of a transformation symmetry of the surface) to which 
a configurational invariant linear in the momentum is associated. 

It is also possible to generalize this geometrical approach to higher-order 
invariants. If the second-order invariant defined by (5.103) is written in the 
form 

I = KijPiPj D, 

where Kij is a symmetrical second-rank matrix, it is easy to see that the 
commutativity of I with expressed by the set of equations (5.104) and 
(5.105), can be written in the compact form 

fc) = 0, 2^,£,, = D^i. (5.176) 

These equations can be interpreted as the conditions for the existence of a 
second-order Killing tensor^^ for the metric pij. The geometric interpretation 
of Killing tensor is not so simple as before (in particular they cannot generate 
simple point transformations,^^ but must be treated as generalized transfor- 
mations, like those introduced in Sect. 1.6), but, as before, (5.176) can also 
be solved at a specified value of the energy, giving rise to a configurational 
quadratic invariant. 

We shall not dwell on this question. However, it gives us the opportunity 
to stress that research activity in diflFerential and Riemannian geometry can 
still be very fruitfully applied in classical mechanics. 



^^See, e.g., R. Wald: General Relativity (University of Chicago Press, 1984), p.440. 
^®K. Rosquist: Killing tensor conservation laws and their generators, J. Math. Phys. 
30 , 2319-2321 (1989). 
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Notation 

Throughout this book the summation convention on repeated indices has 
been adopted; the summation symbol ^ is used only when confusion is 
possible. At the same time, very often we do not distinguish between co- 
variant and contravariant components of vectors and tensors, the question 
being irrelevant in our treatment. The Bachmann-Landau convention (o,0) 
is used to express the notion of “order of’ (see, for instance, E. T. Whittaker, 
G.N. Watson: A Course of Modern Analysis (Cambridge University Press, 
1927), p. 11.). 
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A.l Spherical Trigonometry 

Some simple relations of spherical trigonometry are useful: given a spherical 
triangle with vertex angles A, B and C and sides given by segments of arc 
a, h and c respectively (see Fig. A.l), the relations 

cos a — cos b cos c + sin 6 sin c cos A, 
cos B sin a = cos b sin c — sin b cos c cos A, 
sin A sin B sin C 
sin a sin b sin c 

hold. From (A.l), exchanging the roles of a and 6, we have 
cos b = cos a cos c + sin a sin c cos 5, 
and, further, exchanging a by c and making some calculations, we have 

sin c cos B = cos b sin a. 

One of the equalities contained in relation (A. 3) can be written in the form 

sin c sin 5 = sin b sin C. 

Dividing by this relation the last obtained we can write 

cot B sin C = cot b sin a, (A. 4) 

a formula of use in the application of the theory of orbits in spherical poten- 
tials. 



(A.l) 

(A.2) 

(A.3) 



A 




Fig. A.l 




A. 2 Curvilinear Coordinate Systems 365 



A. 2 Curvilinear Coordinate Systems 

For the study of orbits in potentials in two and three dimensions it is very 
useful to be able to express the equations of motion in arbitrary coordinate 
systems, so as to exploit at their best the methods for solving dynamical 
problems presented in Chaps. 1 and 5. There are several curvilinear coordi- 
nate systems in which the Hamilton- Jacobi equation is separable with po- 
tentials of real interest: we present in what follows the essential properties of 
curvilinear orthogonal coordinates. 

Three scalar functions 

q" = r{x,y,z), i = l,2,3 (A.5) 

of the Cartesian coordinates {x, z} define, if the parameters are varied, a 

curvilinear coordinate system. The functions /* are assumed to be invertible, 
so that it is possible to express the Cartesian coordinates in terms of the 
curvilinear coordinates: x = y = y{q^) and z = z(q^). The fact that 

the correspondence between the coordinates of a point in the two systems is 
one-to-one is warranted by the condition that the Jacobian determinant^ 
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dqi 
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does not vanish. The squared line element in a general coordinate system is 

3 

ds'^ = ^ gijdq^dq> , 

where the elements of the metric tensor gij are given by 

dx dx dy dy dz dz 

dq^ dq^ dq^ dq^ dq^ dq^ 

In the following we present the properties of those coordinate systems where 
the metric tensor is diagonal: gij = G‘fSij. In this case the coordinate frame 
is called orthogonal. Orthogonal coordinates (besides Cartesian coordinates) 
include cylindrical coordinates: 

^ In formulae where curvilinear coordinates appear there is a slight notational in- 
consistency, since the coordinates are indicated by q^ (with upper indices) for 
generic i, whereas they appear with lower indices when this is one of the values 
1,2,3. The notation with the upper index is preferable for various reasons (it is 
of ease in sums such as that in (A. 6) and for consistency with the usual notation 
of differential geometry) so that we try to use it consistently. To avoid confusion 
with exponential powers, however, we have kept the lower indices when they take 
specific values. Finally, the summation symbol is used, since, in formulae like 
(A. 10), no sum is intended. 



(A.6) 

(A.7) 
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= tu = \Jx^ 2/2, q2 = ^ = arctan ^3 = 2;; 

X 

ds^ — dzu^ “h vj^ d^p^ -j- dz^] G\ — 1, G 2 — '^<7, G"^ = 1, 



(A.8a) 



and the spherical ones 

qi = r = \Zx^ + 2/^ + Q2 = d = arccos {z/r) , Qs = p = arctan(^/x); 
ds^ = -h sin^ ddp^; Gi = 1, G2 — r, G3 = r sin-i^. 

(A.8b) 

One more important aspect of the use of curvilinear coordinates is that of the 
relations among the components of the velocities and the momenta and the 
various expressions of the equations of the motion. The metric introduced in 
(A. 6) allows us to define in a natural way (see Sect. 1.4) the quadratic form 
of the kinetic energy 

^ ^ E GpiPiY, (A.9) 



where we have put the mass equal to unity. Associated with the coordinate 
components of the velocity there are components v'^ = Giq^ in the or- 
thonormal basis consisting of the vectors 



= G-' 



dq^' 



(A.IO) 



and the components of the momentum^ 



Pi = Gfc, 



(A.ll) 



introduced by means of a Legendre transformation with T in the role (see 
(l.C.l) of Lagrangian. In cylindrical and spherical coordinates, using the 
metric elements appearing in (A. 8), we have respectively 

= w,p^ = wp, Pz = z (A. 12a) 



and 



Pr = r, p^ = p^ = sin^ dp. 



(A.12b) 



The equations of motion of a unit mass particle in a potential field ^ can 
then be written in the general form 



j,k j 



dq^ 



0, 



(A.13) 



where 



^ Even in this case the lower index is consistent with the standard notation of dif- 
ferential geometry: in the old language of tensorial algebra pi are the components 
of a covariant vector, whereas are the components of a controvariant vector. 
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is the metric inverse of the one in (A. 6) and the quantities jk^ defined by^ 



dq^ dq^ J 



are the coefficients of the connection given by the natural basis associated 
with curvilinear coordinates. For an orthogonal metric it is easy to verify that 



P jk — -^33 

pi 



1 

^ --G~^ ^ 



2 * dq^ ’ 
dqi ’ 



(A.15) 



r\i = G 



where i ^ j ^ k and, as always in this section, there is no sum over repeated 
indices. For example, in cylindrical coordinates the (A.15) become 



P^ 



22 — — tX7, 



7-i 2 p2 

^ 12 = J- 21 — 



W 



all the others zero, (A. 16) 



and (A. 13) assume the two equivalent forms 



and 



W - W(p + 7— = 0, 

OVD 

tb . I 

(^ + 2 — (p -\ 2^ ~~ 

VD VD^ OLp 

.. ^ 



ZU OVD 

1 

H H t: — — 0, 

VD 



^ VD dip 



'^z + ^ — O5 
dz 



(A.17a) 



(A.17b) 



where the transformation of the components of the velocities v'^ = Giq^ has 
been used, giving 



v^=w, v^ = wp, Vz^z. (A. 18) 

Analogously in the case of spherical coordinates, the coefficients of the con- 
nection are 



^ For a detailed discussion of the foundations of geometry in curved spaces we refer 
to D.Laugwitz, Differential and Riemannian Geometry (Academic Press, 1965). 
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r\o = -T 



r^33 = — r sin^ t?, 



p2 p2 p3 7 -i3 

^ 12—^ 21—^ 13=^ 31 = “, 



p3 p3 

^ 23 — ^ 32 — 



r^33 = — sin ^ COS'!?, 

cos'd 



(A.19) 



sin'?? ’ 



all the others zero, 



and the equations of motion are 



r — rd^ — r sin^ d(p‘^ + ^ = 0, 
or 

“ r ' X 

d + 2-d — sindcosdip^ + ^7777 = 0 , 
r od 

r • 1 

(p + 2-ip + 2cot'dd(p+ , ^ — =0, 

r sm d dip 



(A.20a) 



or, equivalently, 



Vr — 



vl+vl 

r dr 



-=0, 



1/ n 2 n ^ 

Vo + -(VrVO - COtdv ) + - — = 0, 
r ^ r av 

1 1 

+ -{VrV^ + coidv^v^n) H ^ r— = 0, 



(A.20b) 



where the transformation of the components of the velocities is given by 

Vr — r, = rd, v^ = rsindip. (A. 21) 

In the general case, (A. 13) and (A. 15) give 






dq^ 

i=i ^ 



dq^ 



0. (A.22) 



Wanting to express this general equation of motion in terms of the veloci- 
ties v'^ or the momenta, we need the components of the connection in the 
orthonormal basis (A. 10), that is, the quantities 






dq^ / 



(A.23) 



where the basis forms d>\ duals of the orthonormal basis (A. 10), are given 
by the relations 

= Gidq\ = 6^j. (A.24) 

Substituting (A. 10) and (A.24) into (A.23) and making use of the definition 

3^ \ 

dq^ dq^/^ 



r^jk ^{dq\ 
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which is the analogue of (A. 23) in the coordinate basis and generates (A. 14), 
we can explicitly determine F. With the agreement, in the following, that 
the indices are such that i ^ j and that F,i represents the partial derivative 
of the generic function F with respect to the coordinate q\ we have 



F^ 



33 



^GAdq\- 



-1 9 

j 

G 



Gidq\G^ dq^ 



-1 d 






G] 

d ^_2 



^ \ — r' r® 



G~, 



Through similar calculations we find all the components of the connection in 
the orthonormal basis: 



-pi P* 

3 ji — 1 ji^ 



Exploiting coordinate expressions (A. 15), these can be written as 

pGj ^ -G~^Gj^Gj^i, f^ji ^ G^^Gj^Gij, all others zero. (A. 25) 

The general form of the equations of motion in the orthonormal frame is 
therefore 

e + G7^GJ^ {G^JV^V^ - G.Av^f) + G-i ~=0. (A.26) 



Using the relation pi = G‘fq^ = Giv^ and exploiting the equations 



3 



dt 



the equations of motion in terms of the momenta are 

3 



1 d 



1 = 1 



dq^ 



(A.27) 



It is worth remarking that these equations, once one has introduced a Hamil- 
ton function H = T F ^ and remembering (A. 9), are the expression of the 
canonical equations (1.C.3) for the momenta in a general curvilinear coordi- 
nate system. 
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A. 3 Riemannian Geometry 



One more reason for introducing the kinematics along the lines sketched in 
the previous section is that, without any change in the notation, they allow 
us to treat the general case of motion in a non-Euclidean space. We shall 
restrict ourselves here to the case of Riemannian space, that is, space in 
which the metric element is determined by Qah, which is a positive definite 
tensor. Our interest in Riemannian geometry is justified by the fact that 
we have encountered several “geometric” ways of formulating the laws of 
mechanics (such as Jacobi’s approach, described in Sect. 1.8), so that in many 
circumstances a complete geometric treatment of orbit theory can be useful. 

The principal aspect to emphasize is that the metric element given by 
(A. 6) describes, in general, the properties of non-Euclidean spaces in which 
the distance between infinitesimally close points is a quadratic form in the 
increments of the coordinates that refiects intrinsic geometric properties, 
rather then a simple change of coordinates in fiat space. It is therefore obvious 
that it is essential to be able to characterize in an invariant way (that is, in a 
way independent of the chosen coordinates) the intrinsic curvature properties 
of the space: we will devote the final part of this section to this aim. For the 
moment, we want to consider kinematics in curved space. 

The equations of motion in a generic curved space are essentially the same 
as (A. 13), which we write here again, but without the potential term to better 
examine only the geometric aspects of the motion:'^ 



^ , ra 

ds"^ ds ds 



(A.28) 



The only differences with respect to the formulae of the previous section are 
that now the independent variable is no longer the time but an arbitrary 
affine parameter along the curve and that the convention is adopted of the 
sum over repeated indices. Equations (A.28) are usually called the equations 
of the geodesics^ since their solutions give the curves of extremal length in 
the geometry given by the metric element (A. 6). If the space is fiat (and the 
potential is zero) these solutions are obviously straight lines. If one defines 
the velocity vector u" = dq^jds^ then (A.28) can be written in the form 



du^ 

ds 



+ r^^bcu’^u’^ = 0 . 



(A.29) 



Writing the differentiation with respect to s as the total derivative 



_d _ ^ d 

ds ds dq^ ^ dq^' 



^ Actually, the Jacobi geometry just referred to has the effect of incorporating 
the potential into the metric tensor, so that the T’s convey dynamics into the 
geometry of a suitable space. 
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we can interpret the left-hand side of (A. 29) as the action of a differential 
operator on a generic vector field X^, 



DX^ 

ds 



ds 



dX^ 

dq^ 



+ r^hcX^ 



(A.30) 



If the result of this action vanishes (that is, if DX^ jds — 0), the vector X^ is 
said to be parallely transported along the integral curves of . The operation 
in the brackets in (A.30) can be performed in general without reference to 
a curve: it is called the covariant derivative and is denoted for the sake of 
brevity as follows 

= + (A.31) 

It can be applied to tensors of arbitrary rank and to covectors defined by the 
general duality relation 

Xb = gabX^, (A.32) 

the generalization of (A. 11) in the previous section. As an example which en- 
ables us to present a quite generic situation, consider the covariant derivative 
of a mixed second-rank tensor it is written 

A\-,c = - r\cA^d + r^dcA\. (A.33) 



In the metric tensor gab all the information about the intrinsic geometrical 
structure of the space considered is contained, so that, for example, the ex- 
plicit expression of the connection coefficients is given by (A. 14). Since all 
the formulae involving geometric objects (vectors, tensors, curves, etc.) and 
algebraic and differential operations with them, are the same as are found 
working with generic coordinates in Euclidean space, one needs an invariant 
way to discriminate between the two situations. In other words, we need some 
tool that enables us to know if the metric (and its connection) we are using, 
represents a genuine non-Euclidean (or, as it is usually said, “curved”) space 
or whether it is simply related to some, maybe very complicated, change of 
coordinates in flat space. This tool is the Riemann curvature tensor^ defined 
as 

rya /-la 7-ia 1 7-ia r-ifc 7-ia rik /a 

^ bed — A bd,c-J- bc,d + d kc-L bd — J- kdJ- bc‘ (A. 34) 

This tensor enjoys the fundamental property of being identically zero if and 
only if the space is flat: since this property does not depend on the coor- 
dinates used, the Riemann tensor is the invariant intrinsic representative of 
the geometric properties of the space in which motion happens. Several other 
geometric objects can be constructed starting from the Riemann tensor; an 
important quantity is the Gaussian curvature in the plane identified by two 
unit vectors n® and m“: 



C(n, m) = Rabcd^''rn^n''n/ . (A. 35) 

There are then two tensors (actually a tensor and a scalar) that play an 
important role in Riemannian geometry: 
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Rab = R"" acb == g'^'^Rcadb (A. 36) 

and 

R = g^^^Rab = g^^g^^Rcadb. (A.37) 

These are respectively the Ricci tensor and the Ricci curvature scalar. There 
are simple but important relations between these tensors and the Gaussian 
curvatures: taking as unit vectors the orthonormal basis vectors (where 
the lower index labels which vector it is) to compute the quantities in (A. 35), 
we have the Gaussian sectional curvatures 

Cij = Rabcde‘!e)e1ej. (A.38) 

It is quite easy to verify that the sum of the sectional curvatures in the 
direction of is given by 



J2CiJ = Rabml 

3 

SO that it is related to the Ricci tensor, whereas the sum of all the curvatures 
is related to the Ricci scalar by 






An application of the Riemann tensor which is of particular interest in 
the study of dynamical systems is to the analysis of the relative behaviour 
of geodesics, namely their stability properties and their long time behaviour. 
Let us assume that we are given a solution of equation (A. 29), that is, a 
geodesic taken as a “reference” and let be a vector representing a small 
displacement with respect to the reference geodesic. If is the tangent vector 
of the reference, the evolution of the perturbation is governed by the equation 



ds‘^ 



+ R^bcdu^n^u^ = 0 . 



(A.39) 



This fundamental equation is called the equation of the geodesic deviation 
or Jacobi-Levi-Civita equation^ and provides all the information necessary 
to determine the influence of slight changes in the initial conditions on the 
geodesic flow. 

One flnal argument that is worth examining for its applicability in analyt- 
ical dynamics is the question of the symmetries of the metric. Let us consider 
the transformation given by a one-parameter group of motions generated by 
a vector field and ask for the conditions such that the metric tensor is 
invariant under such a transformation. It turns out that these conditions are 



gab,cf, T dacf, ,b T dchf, ,a — O 5 

^ See T. Levi-Civita: Sur I’ecart geodesique. Math. Ann. 97 , 291-320 (1927). 
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which, looking at the relations between the connection coefficients and the 
derivatives of the metric tensor (A. 14), can be written in the equivalent forms 

^a,b + ^b,a - r^'ab^c = 0 , 

or 

2C(a;6) = ^a-,b + 6;a = 0. (A. 40) 

These last two equations are known as Killing equations in Riemannian geom- 
etry and the vector as a Killing vector. Its fundamental importance rests 
on the fact that its existence guarantees the existence of conserved quanti- 
ties along the geodesics. Let us consider the scalar quantity J = and 
compute its derivative along the geodesic whose tangent vector is itself: 

^ = (a;bU'^u!’ + £,aU°‘-bu'’ = 0. (A.41) 

We find that the derivative vanishes, since the first term is zero by Killing 
equation (A. 40) and the second is also zero from the equation of the geodesics. 
J is therefore conserved along the motion. A straightforward generalization of 
this circumstance^ is to conserved quantities of higher order in the velocities: 
suppose that we want to find the condition such that the scalar 

/ = AabU^-y!^ 

is conserved along the geodesics. A calculation in the same spirit as that in 
(A. 41) shows that I is conserved if and only if the equations 

A(ab-,c) = 0 (A.42) 

are satisfied. Generalizing the above definition, the symmetric tensor Aab^ 
which satisfies the condition that its symmetrized covariant derivatives van- 
ish, can be called a Killing tensor. 



See, for instance, L. P. Eisenhart: Riemannian Geometry (Princeton University 
Press, 1950), Sect. 39. 
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The purpose of these notes is twofold. On the one hand, they are intended 
to provide the reader with a list of the sources relevant to each section and 
of the books or papers which are deemed fundamental for deepening one’s 
knowledge of the topic under consideration: on the other hand, occasionally 
the discussion is broadened by trying to locate the topic in a wider and 
clearer context, while having in mind the aim of stimulating a non-passive 
understanding. 

In these notes, any works already mentioned in the text are not repeated, 
except in the case of further specific discussion. 

Introduction 

As far as we know, there exists no study on the evolution of mental schemes or 
models (we do not know how better to describe them) and on their reappear- 
ance from time to time in the history of scientific thought, borrowing from the 
contingent cultural context the forms by which they can be expressed. One 
case we have tried to emphasize, and which is well known to the community 
of experts in celestial mechanics, is that of epicycles and Fourier series, but 
certainly others exist-less macroscopic cases-which, from Ptolemaic theory 
to modern celestial mechanics, have continued equally unchanged in their 
inner nature of mental schemes. 

To deepen the “techniques” used in ancient times for the study of the 
kinematics of celestial bodies, a textbook that is still useful is that mon- 
umental one by P. Duhem: Le Systeme du Monde - Histoire des Doctrines 
Cosmologiques de Platon d Copernic (Hermann, Paris, 1913). Another in- 
teresting textbook is G. V. Schiaparelli: Scritti sulla Storia delVAstronomia 
Antica (Zanichelli, Bologna, 1927; available only in Italian). Also important, 
in order to follow the evolution of scientific thought in the pre-Copernican 
era and the history of ancient astronomy, is J. L. E. Dreyer: History of the 
Planetary Systems from Thales to Kepler (Cambridge University Press, 1906; 
reprinted by Dover). 

The studies on ancient astronomy by Neugebauer deserve a separate men- 
tion. Except for O. Neugebauer: The Exact Sciences in Antiquity (Princeton 
University Press, 1952), a work suitable for a reader of average scientific 
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education, they are texts addressed essentially to specialists. We confine our- 
selves to mentioning O. Neugebauer: A History of Ancient Mathematical As- 
tronomy^ Parts 1-3 (Springer, 1975). 

For the Copernican revolution, a well-known text is T. S. Kuhn: The 
Copernican Revolution. Planetary Astronomy in the Development of West- 
ern Thought (Harvard University Press, Cambridge, 1957). The period from 
ancient times up to Kepler is dealt with by the above-mentioned history of 
astronomy. For readers who can understand Italian, a small book which is a 
mine of information as well as being very useful is R. Marcolongo: II Problema 
dei Tre Corpi (Hoepli, Milan, 1914). 

As far as Newton’s works are concerned, besides Cajori’s edition, we note 
Isaac Newton’s Philosophiae Naturalis Principia Mathematica. The third edi- 
tion (1726) with variant readings^ ed. by A. Koyre and I. B. Cohen (Cam- 
bridge University Press, 1972). Moreover, since it would be foolish to pre- 
tend to list a bibliography, however summary, of the studies on Newto- 
nian thought, we only mention R. S. Westfall: Never at Rest. A Biography 
of I. Newton (Cambridge University Press, 1980) and A. Koyre: Newtonian 
Studies (Chapman & Hall, London, 1965), where the reader can find the 
desired references. 

Obviously, our interest is centred on the evolution of theories related to 
the motion of the celestial bodies; therefore our indications ought always to be 
aimed at this goal, rather than to acquiring a broad knowledge of the history 
of astronomy or mechanics. Hence it is perhaps more appropriate with regard 
to modern authors to refer directly to their works. For instance, in the case of 
Poincare it is convenient to approach his masterpiece itself. Here we mention 
the English translation, H. Poincare: New Methods in Celestial Mechanics 
(Hilger, 1990). Furthermore, the following book is helpful, R. Dugas: Histoire 
de la Mecanique (Editions du Griffon, Neuchatel, 1955; reprinted by Dover 
as A History of Mechanics^ 1988). 

In order not to repeat ourselves, we refer to the relevant chapters in our 
Volume 2 the bibliography on perturbation theory and related problems, 
and on chaotic motions. On this last topic, we suggest the agile booklet of 
D. Ruelle, which affords a pleasant and intelligent reading: Chance & Chaos 
(Princeton University Press, 1991); this represents one of the rare examples of 
educated imparting of knowledge. We also suggest the paper by V. Szebehely: 
Is Celestial Mechanics deterministic?, in Applications of Modern Dynamics to 
Celestial Mechanics and Astrodynamics, ed. by V. Szebehely (Reidel, 1982). 

Chapter 1: Dynamics and Dynamical Systems - Quod Satis 
A, Newtonian Dynamics 

Sects. 1.1, 1.2: An ever-growing number of books about dynamical systems 
is being produced. Since we are interested in a very narrow area, we confine 
ourselves to the following. 
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J. Guckenheimer, P. Holmes: Nonlinear Oscillations, Dynamical Systems and 
Bifurcations of Vector Fields (Springer, 1983). 

A. J. Lichtenberg, M. A. Lieberman: Regular and Stochastic Motion (Springer, 

1983); 2nd edn: Regular and Chaotic Dynamics (Springer, 1992). 

F. Verhulst: Nonlinear Differential Equations and Dynamical Systems (Sprin- 
ger, 1990). 

P. Hagedorn: Nonlinear Oscillations, 2nd edn (Clarendon Press, Oxford, 
1988). 

M. Tabor: Chaos and Integrability in Nonlinear Dynamics (Wiley, 1989). 

Also, the “classic” textbooks on differential equations, such as 

E. A. Coddington, N. Levinson: Theory of Ordinary Differential Equations 
(McGraw-Hill, New York, 1955). 

J. K. Hale: Ordinary differential equations (Wiley-Interscience, New York, 
1969). 

L. Cesari: Asymptotic Behaviour and Stability Problems in Ordinary Differ- 

ential Equations (Springer, 1971). 

M. W. Hirsch, S. Smale: Differential Equations, Dynamical Systems and Lin- 
ear Algebra (Academic Press, 1974). 

Still interesting today is Liapunov’s book: Probleme Ceneral de la Sta- 
bilite du Mouvement (Princeton University Press, 1947; a reproduction of the 
French translation, 1907, of the Russian original, 1892). 

Sect. 1.3: The problem of the linear oscillator is dealt with more or less 
broadly in all the treatises on mechanics. Our treatment is taken from 
V. I. Arnold: Mathematical Methods of Classical Mechanics, 2nd edn (Sprin- 
ger, 1989). Also interesting is the paper by K.R. Meyer: The geometry of 
harmonic oscillators, Amer. Math. Monthly 97, 457-465 (1990), reported af- 
terwards in the book K. R. Meyer, G. R. Hall: Introduction to Hamiltonian 
Dynamical Systems and the N-Body Problem (Springer, 1992). 

B. Lagrangian Dynamics 

Sects. 1.4, 1.5: In order to remain in the frame of the “classics”, besides 
the books of Whittaker and Synge already mentioned, a book we like more 
than any other, which emphasizes the “geometric” approach to mechanics is 

C. Lanezos: The Variational Principles of Mechanics, 4th edn (Toronto Uni- 
versity Press, 1970; reprinted by Dover, 1986). We add L. A. Pars: A Treatise 
on Analytical Dynamics (Heinemann, London, 1964). 

Sects. 1.6, 1.7: The theorem stated in 1918 by Emmy Noether is doubt- 
less one of the most important achievements of mathematical physics in this 
century. Even today, perhaps, it is difficult to fully evaluate its implications 
in the various fields of applied mathematics. We have chosen the version 
which we consider most suitable for application to discrete systems, which 
are implied in celestial mechanics and stellar dynamics. The point of view 
at the heart of this formulation was proposed for the first time by H. Rund 
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and D. Logan (see below). Lack of space prevents us from mentioning the 
enormous number of papers which have been devoted to Noether’s theorem 
from 1918 onward. We shall confine ourselves solely to those which have been 
brought into consideration for our exposition: 

E. Bessel-Hagen: Uber die erhaltungssatze der Elektrodynamik, Math. Ann. 
84, 258-276 (1921). 

E. L. Hill: Hamilton’s principle and the conservation theorems of mathemat- 
ical physics, Rev. of Mod. Phys. 23, 253-260 (1951). 

A. Trautman: Noether equations and conservation laws, Commun. Math. 
Phys. 6, 248-261 (1967). 

H. Rund: A direct approach to Noether’s theorem in the calculus of variations, 
Utilitas Mathematica 2, 205-214 (1972). 

J. D. Logan: On some invariance identities of H. Rund, Utilitas Mathematica 
7, 281-286 (1975). 

Dj.S.Djukic: A procedure for finding first integrals of mechanical systems 
with gauge- variant Lagrangians, Int. J. Non-linear Mech. 8, 479-488 
(1973). 

Dj.S.Djukic, B. D. Vujanovic: Noether’s theory in classical nonconservative 
mechanics, Acta Mechanica 23, 17-27 (1975). 

J. D. Logan: Invariant Variational Principles (Academic Press, New York, 
1977). 

J. A. Kobussen: On a systematic search for integrals of the motion, Helvetica 
Physica Acta 53, 183-200 (1980). 

Sect. 1.8: The principle of least action in Jacobi form is dealt with in various 
textbooks on mechanics; among those already mentioned, we recall the books 
of Lanczos and Pars and, in addition, L. Brillouin: Les Tenseurs en Mecanique 
et en Elasticite (Masson et Cie, Paris, 1945). 

Today, the range of applications of “Jacobi geometry” has been consider- 
ably enlarged. We mention a few papers 

V. G. Gurzadyan, G. K. Savvidy: Collective relaxation of stellar systems, As- 
tron. Astrophys. 160, 203-213 (1986). 

D. Boccaletti, G. Pucacco, R. Ruffini: Multiple relaxation time-scales in stellar 
dynamics, Astron. Astrophys. 244, 48-51 (1991). 

P. Cipriani, G. Pucacco: Jacobi geometry and chaos in A'-body systems, in 
Proceedings of the Workshop “From Newton to Chaos’\ ed. by A. Roy 
(NATO ASl, Cortina, 1993). 

M. Pettini: Geometrical hints for a nonperturbative approach to Hamiltonian 
dynamics, Phys. Rev. F 47, 828-850 (1993). 

M. Szydlowski: Curvature of gravitationally bound mechanical Systems, J. 
Math. Phys. 35, 1850-1880 (1994). 

S.Bazanski, P. Jaranowski: The inverse Jacobi problem, J. Phys. A: Math. 
Gen. 27, 3221-3234 (1994). 
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C. Hamiltonian Dynamics and Hamilton- Jacobi Theory 

Sects. 1.9—1.15: The relevant textbooks have been mentioned in the foot- 
notes. In addition, we mention a stimulating and, at the same time, simple 
approach to a modern view of classical mechanics, I. Percival, D. Richards: 
Introduction to Dynamics (Cambridge University Press, 1982), and an inter- 
esting opportunity for casting a glance at topics tightly bound to classical 
dynamics, D. Park: Classical dynamics and its quantum analogues, 2nd edn 
(Springer, 1990). 

Sects. 1.16, 1.17: The metod of action-angle variables can be found in al- 
most all the textbooks mentioned so far; nevertheless, we confirm our convic- 
tion that Born’s is the clearest exposition from the point of view of applica- 
tions. Historically, the first systematic treatment is due to K. Schwarzschild in 
his paper Zur Quant umhypothese, Sitzber. Berl. Akad. Wiss. 548-568 (1916), 
where terms action variables and angle variables (in German Wirkungsvari- 
able and Winkelvariable respectively) appeared for the first time. 

Sect. 1.18: A proof of Liouville’s theorem in terms of modern differen- 
tial geometry can be found, for instance, in B. A. Dubrovin, A. T. Fomenko, 
S. P. Novikov: Modern Geometry-Methods and Applications, Vol. II (Springer, 
1991). 

Sect. 1.19: On the Painleve property, see M. D. Kruskal, P. A. Clarkson: The 
Painleve-Kowalevski and Poly-Painleve tests for integrability. Studies in Ap- 
plied Mathematics 86, 87-165 (1992). 

Chapter 2: The T^vo-Body Problem 

Sect. 2.1: The main source of this section is a small but “classic” book 

H. Pollard: Mathematical Introduction to Celestial Mechanics (Prentice-Hall, 
1966; reprinted in Cams Mathematical Monographs, 1976). 

Obviously, with regard to this problem, the reader has only an embarass- 
ment of choice. Besides the book of Danby, already mentioned in the Preface, 
and a classic like that of Moulton, we mention 

D. Brouwer, G. Clemence: Methods of Celestial Mechanics (Academic Press, 

1961). 

H. C. Plummer: An introductory Treatise on Dynamical Astronomy (Cam- 
bridge University Press, 1918). 

A.E.Roy: Orbital Motion, 3rd edn (Adam Hilger, 1988). 

W.M. Smart: Celestial Mechanics (Longmans, Green and Co., 1953). 

L. G. Taff: Celestial Mechanics. A Computational Guide for the Practitioner 
(Wiley, 1985). 




380 Bibliographical Notes 



Sect. 2.2: For the history of the L-R-L vector, see the papers by H. Gold- 
stein: Prehistory of the Runge-Lenz vector, Am. J. Phys. 43, 735-738 (1975), 
and: More on the prehistory of the Laplace-Runge-Lenz vector. Am. J. Phys. 
44, 1123-1124 (1976) and D.Park, op. cit., pp. 73-75. 

The literature on the L-R-L vector and its generalizations is at present 
so plentiful that we cannot even try to give a representative list. The main 
stream of research has concerned the Lie-group approach. In the early times, 
we have 

V. Fock: Zur Theorie des Wasserstoffatoms, Z. Physik 98, 145-154 (1935). 

V. Bargmann: Zur Theorie des Wasserstoffatom, Z. Physik 99, 576-582 
(1936). 

V. Bargmann: Irreducible representations of the Lorentz group. Annals of 
Math. 48, 568-640 (1947). 

And then 

H. Bacry, H. Ruegg: Dynamical groups and spherical potentials in classical 
mechanics, Commun. Math. Phys. 3, 323-333 (1966). 

V. A. Dulock, H. V. McIntosh: On the degeneracy of the Kepler problem, Pac. 
J. Math. 19, 39-55 (1966). 

D. M. Fradkin: Existence of the dynamic symmetries O 4 and SUs for all clas- 
sical central potential problems. Prog. Theor. Phys. 37, 798-812 (1967). 

G. Gyorgyi: Kepler’s equation, Fock variables, Bacry’s generators and Dirac 
Brackets, II Nuovo Cimento A 53, 717-736 (1968). 

G. E. Prince, C. J. Eliezer: On the Lie symmetries of the classical Kepler prob- 
lem, J. Phys. A, Math. Gen. 14, 587-596 (1981). 

V. M. Gorringe, P. G. Leach: The first integrals and their Lie algebra of the 
most general autonomous Hamiltonian of the form H = T-{-V possessing 
a Laplace-Runge-Lenz vector, J. Austral. Math. Soc. Ser. B 34, 511-522 
(1993). 

V. M. Gorringe, P. G. Leach: Kepler’s third law and the oscillator’s isochro- 
nism. Am. J. Phys. 61, 991-995 (1993). 

The last two authors have devoted many interesting papers to the L-R-L 
vector and its generalizations in Kepler-like problems; the reader can find 
the references in the two papers quoted. For the approach using Noether’s 
theorem, we mention the above cited paper by Kobussen. 

Sect. 2.3: Our treatment follows the suggestions contained in Arnold’s book. 
See also 

J. Roels, C. Aerts: Central forces depending on the distance only-case where 
all the bounded orbits are periodic. Celestial Mech. 44, 77-85 (1988). 

E. Onofri, M. Pauri: Search for periodic Hamiltonian flows: A generalized 

Bertrand’s theorem, J. Math. Phys. 19, 1850-1858 (1978). 




Bibliographical Notes 381 



Sect. 2.4: For recent research on Kepler’s equation, see 

T. M. Burkardt, J. M. A. Danby: The solution of the Kepler’s Equation-I, Ce- 
lestial Mech. 31, 95-107 (1983); The solution of the Kepler’s Equation-II, 
ibid., 317-328 (1983). 

P. Colwell: Kepler’s equation and Newton’s method. Celestial Mech. 52, 203- 
204 (1991). 

R. H. Gooding, A. W. Odell: Procedures for solving Kepler’s Equation, Celes- 

tial Mech. 38, 307-334 (1986); The hyperbolic Kepler’s Equation (and 
the elliptic equation revisited), ibid. 44, 267-282 (1988). 

S. Mikkola: A cubic approximation for Kepler’s Equation, Celestial Mech. 40, 

329-334 (1987). 

E. W. Ng: A general algorithm for the solution of Kepler’s Equation for elliptic 
orbits. Celestial Mech. 20, 243-249 (1979). 

A. Nijenhuis: Solving Kepler’s Equation with high efficiency and accuracy. 
Celestial Mech. 51, 319-330 (1991). 

R. A. Serafin: Bounds on the solution to Kepler’s Equation, Celestial Mech. 
38, 111-121 (1986). 

G. R. Smith: A simple, efficient starting value for the iterative solution of 
Kepler’s Equation, Celestial Mech. 19, 163-166 (1979). 

L. G. Taff, T. A. Brennan: Solving Kepler’s Equation, Celestial Mech. 46, 163- 

176 (1989). 

Sect. 2.5: The books mentioned for Sect. 2.1 are good references for this 
section also. 

Sect. 2.6: Additional bibliography on the problem of regularization can be 
found in the footnotes to Sects. 3.4, 4.2, 4.6. Interesting readings for a math- 
ematically oriented reader are represented by: 

R. Easton: Regularization of vector fields by surgery, J. Diff. Eq. 10, 92-99 
(1971). 

J. Milnor: On the geometry of the Kepler problem, Amer. Math. Monthly 90, 
353-365 (1983). 

J. Moser: Regularization of the Kepler’s problem and the averaging method 
on a manifold, Commun. Pure Appl. Math. 23, 609-636 (1970). 

E. A. Belbruno: Two-body motion under the inverse square central force and 
equivalent geodesic fiows. Celestial Mech. 15, 465-476 (1977). 

Yu. Osipov: The Kepler problem and geodesic fiows in spaces of constant 
curvature. Celestial Mech. 16, 191-208 (1977). 

M. D. Vivarelli: On the connection among three classical mechanical problems 
via the hypercomplex KS transformation. Celestial Mech. 50, 109-124 
(1991). 

J. F. Carifiena, C. Lopez, M. A. Del Olmo, M. Santander: Conformal geometry 
of the Kepler orbit space. Celestial Mech. 52, 307-343 (1991). 




382 Bibliographical Notes 



Many papers on the subject of this section can be considered relevant for 
Sect. 2.2 as well. Furthermore, note that many things have been discovered 
and subequently rediscovered; see, for instance, 

V. I. Arnold, V. A. Vasilev: Newton’s Principia read 300 years later. Not. 

Amer. Math. Soc. 36, 1148-1154 (1989). 

L. Mittag, M. J. Stephen: Conformal transformations and the application of 
complex variables in mechanics and quantum mechanics. Am. J. Phys. 
60, 207-211 (1992). 

T. Needhan: Newton and the transmutation of force, Amer. Math. Monthly 
100, 119-137 (1993). 

Sect. 2.7: We have summarized the paper by W. Kaplan: Topology of the 
two-body problem, Amer. Math. Monthly 49, 316-323 (1942). See also 
S.Smale: Topology of mechanics-I, Inventiones Math. 10, 305-331 (1970); 
Topology of mechanics-II, ibid. 11, 45-64 (1971). 



Chapter 3: The AT-Body Problem 

Sects. 3.1— 3.5: As additional reading, we suggest a paper by G.Benettin, 
L. Galgani, A. Giorgilli: On the Poincare’s non-existence theorem, in Advances 
in Nonlinear Dynamics and Stochastic Processes^ ed. by R. Livi, A. Politi 
(World Scientific, 1985), and a review on Painleve’s conjecture by F. N. Diacu: 
The Mathematical Intelligencer 15, 6-12 (1993). 

Sect. 3.7: For this section, the reader can also consult Y. Hagihara: Celestial 
Mechanics^ Vol. I (MIT Press, 1970), Chap. 3. This book is particularly useful 
for its rich bibliography. 

Sect. 3.8: The central configurations of the V-body problem have always 
provoked great interest among mathematicians. We confine ourselves to men- 
tioning only three papers, in which the reader can find further references 

D. G. Saari: On the role and the properties of V-body central configurations. 
Celestial Mech. 21, 9-20 (1980). 

F. Pacella: Central configurations of the V-body problem via equivalent 
Morse theory. Archive for Rat. Mech. and Anal. 97, 59-74 (1987). 

J. Llibre: On the number of central configurations in the V-body problem. 
Celestial Mech. 50, 89-96 (1981). 

As a final reading, we suggest the book by F.N. Diacu: Singularities of 
the N-body problem. An Introduction to Celestial Mechanics (Universite de 
Montreal, Centre de Recherches Mathematiques, Montreal, PQ, 1992). Un- 
fortunately, this beautiful book was not available to us while we were writing 
this chapter. 




Bibliographical Notes 383 



Chapter 4: The Three-Body Problem 

Sects. 4.1, 4.2: As a general reference, the reader can refer to 

C.Marchal: The three-body problem (Elsevier, 1990). 

Y. Hagihara: Celestial Mechanics^ Vol. IV (Japan Society for the Promotion 
of Science, 1975). 

Sect. 4.3: For the restricted problem, the main source is still the magnificent 
book by Szebehely, which we have already made abundant use of. 

Sect. 4.4: To complete the bibliography given in the footnotes, see 

K. G. J. Jacobi: Sur le mouvement d’un point et sur un cas particulier du 
probleme des trois corps, Compte Rendu de TAcademie des Sciences, 
Paris, pp. 59-61 (1836). 

A. M. Leontovich: On the stability of Lagrange’s periodic solutions of the 
restricted three-body problem, Soviet Math. Dokl. 3, 425-429, (1962). 
A.Deprit, A. Deprit Bartholome: Stability of the triangular Lagrangian 
points, Astron. J. 72, 173-179 (1967). 

Sect. 4.5: The source is Szebehely’s book. 

Sect. 4.6: More complete references are in Y. Hagihara: Celestial Mechanics, 
Vol. IV (see above). 

Sect. 4.7: For the motion of the Moon, see also 
A. Cook: The Motion of the Moon (Adam Hilger, 1988). 

Chapter 5: Orbits in Given Potentials 

Sects. 5.1— 5.3: The subject of these sections is so general that the litera- 
ture devoted to it is enormous. Just to complete the references on isolating 
integrals we cite, besides Wintner’s book, 

K. F. Ogorodnikov: Dynamics of Stellar Systems (Pergamon, Oxford, 1965). 
E. Onofri, M. Pauri: Constants of motion and degeneration in Hamiltonian 
Systems, J. Math. Phys. 14, 1106-1115 (1973). 

Sect. 5.4: As to the problem of the separability of the Hamilton- Jacobi equa- 
tion, we also refer to 

P. Stackel: Fine charakteristiche Eigenschaft der Flachen, deren Linienele- 
ment ds durch ds‘^ = {x{Qi) + KQ 2 )) {dqf + dq^) gegeben wird. Math. 
Ann. 35, 91-101 (1890). 

T. Levi-Civita: Sulla integrazione della equazione di Hamilton- Jacobi per 
separazione di variabili. Math. Ann. 66, 383-397 (1904). 

P. Burgatti: Determinazione delle equazioni di Hamilton- Jacobi integrabili 
mediante la separazione delle variabili, Rendiconti Ace. Naz. Lined 20, 
108-111 (1911). 




384 Bibliographical Notes 



F. A. Dall’Acqua: Le equazioni di Hamilton- Jacobi che si integrano per sepa- 
razione di variabili, Rendiconti Circ. Mat. Palermo 33, 341-351 (1912). 

L. P. Eisenhart: Separable Systems of Stackel, Ann. Math. 35, 284-305 (1934). 

Sect. 5.5: The complete generalization of Darboux’s results has been given 
by B. Dorizzi, B. Grammaticos and A. Ramani: A new class of integrables 
systems, J. Math. Phys. 24, 2282-2288 (1983). 

Sect. 5.7: Interesting reading is provided by V. Beletski: Essais sur le Mou- 
vement des Corps Cosmiques, 2nd edn (Editions Mir, Moscou,1986). 

Sect. 5.9: The study of configurational invariants starts with Birkhoff’s 
book, G.Birkhoff: Dynamical Systems (1927; reprinted by the American 
Mathematical Society, Providence, Rhode Island 1966). 

Invariants at fixed energy are very common in field theory since the pio- 
neering work of Dirac: 

P. A. M. Dirac: Homogeneous variables in classical dynamics, Proc. Cambridge 
Phil. Soc. 29, 389-400 (1933). 

P. A. M. Dirac: Generalized Hamiltonian dynamics, Canad. J. Math. 2, 129- 
148 (1950). 

P. A. M. Dirac: Lectures on Quantum Mechanics (Yeshiva University, New 
York, 1964). 

More on the use of Killing tensors can be found in 

P. Sommers: On Killing tensors and constants of motion, J. Math. Phys. 14, 
787-790 (1973). 

E. G.Kalnins, W. Miller, Jr.: Killing tensors and variable separation for 
Hamilton- Jacobi and Helmholtz equations, SIAM J. Math. Anal. 11, 
1011-1026 (1980). 

K.Rosquist, G. Uggla: Killing tensors in two-dimensional space-times with 
applications to cosmology, J. Math. Phys. 32, 3412-3422 (1991). 

K. Rosquist, G. Pucacco: Invariants at fixed and arbitrary energy - A unified 
geometric approach, J. Phys. A: Math. Gen. 28, 3235-3252 (1995). 

S. Benenti: Intrinsic characterization of the variable separation in the 
Hamilton- Jacobi equation, J. Math. Phys. 38, 6578-6602 (1997). 

M. Karlovini, K. Rosquist: A unified treatment of cubic invariants at fixed 
and arbitrary energy, J. Math. Phys. 41, 370-384 (2000). 




Name Index 



Page numbers in italics indicate that the name is referred to in a footnote 
or in the Bibliographic Notes. 



Aerts, C. 380 
Apollonius 2 
Appel, P. 140 
Arenstorf, R. F. 283 
Arnold, V. I. 8, 10, 144, 269, 270, 377, 
380, 382 

Bacry, H. 380 

Bargmann, V. 380 

Bauer, F. 23 

Bazanski, S. 378 

Belbruno, E. A. 381 

Beletski, V. 384 

Benenti, S. 384 

Benettin, G. 382 

Bertrand, J. 131, 141, 141^ 146 

Bessel, F.W. 152, 154 

Bessel-Hagen, E. 188, 378 

Binet A. 134 

Binney, J.J. 306, 307, 307, 316, 320 
Birkhoff, G.D. 11, 270, 270, 283, 283, 
384 

Boccaletti, D. 378 

Born, M. 106, 106, 109, 113, 307, 307, 
320, 379 

Brennan, T. A. 381 
Brillouin, L. 178, 378 
Brouwer, D. 379 
Brown, E. W. 298, 299 
Bruns, E. H. 184, 187, 187 
Burkardt, T. M. 381 
Burgatti, P. 92, 383 

Gajori, F. 5, 376 

Cantrijn, F. 4^^ ^^0 

Carihena, J. F. 381 

Cartan, E. 65, 65, 69, 69, 70, 70, 75 



Gasati, G. 10 

Cauchy, A. L. 155, 165, 168, 180, 182, 
184, 192, 196, 244, 247 
Cavendish, H. 8 
Celletti, A. 286 
Cesari, L. 377 
Chandrasekhar, S. V, 302 
Chang, Y. F. 123 
Chazy, J. 201, 201, 205, 205, 208 
Chebyshev, P. L. 55 
Cipriani, P. 378 
Clairaut, A. C. 6, 338 
Clarkson, P. A. 379 
Clausius, R. 209, 209, 211 
Clemence, G. 379 
Coddington, E. A. 377 
Cohen, LB. 376 
Colwell, P. 381 
Cook, A. 383 
Copernicus, N. 3, 4 
Courant, R. 337, 337 

D’Alembert, J. le R. 6 
Dall’Acqua, F. A. 92, 384 
Damour, T. 178 
Danby, J. M. A. VII, 379, 381 
Dante 9 

Darboux, G. 92, 146, 337, 337, 338, 
384 

Darwin, G. H. 265 
Del Olmo, M. A. 381 
Delaunay, Ch.-E. 161, 162, 171, 272, 
273, 285, 296, 316, 322 
Deprit, A. 270, 383 
Deprit Bartholome, A. 270, 383 
de Vries, G. 117 

de Zeeuw, P. T. 323, 323, 353, 353 




386 Name Index 



Diacu, F. N. 382 
Dirac, P. A. M. 384 
Dirichlet, P. G. L. 266 
Djukic, Dj.S. 378 
Dorizzi, B. 384 
Dreyer, J. L. E. 375 
Dubrovin, B. A. 379 
Dugas, R. 376 
Duhem, P. 375 
Dulock, V. A. 380 

Easton, R. 381 

Eddington, A. S. VII, 209, 340 
Einstein, A. 146 
Eisenhart, L. P. ^5, 92, 373, 384 
Eliezer, C. J. 380 

Euler, L. 6, 63, 211, 216, 224, 229, 257, 
341 

Feshbach, H. 323, 323 
Flaschka, H. 119 
Fock, V. 178, 380 
Fomenko, A. T. 379 
Ford, J. 10 

Fourier, J.B.J. 4, 111, 151 
Fradkin, D. M. 380 

Galilei, G. 191 

Galgani, L. 382 

Gantmacher, F. R. 23 

Gauss, K. F. I 46 

Gerhard, O. E. 316 

Gerver, J. L. 197, 197 

Giorgilli, A. 286, 382 

Goldstein, H. 32, 47, 128, 332, 380 

Gooding, R. H. 381 

Gorringe, V. M. 380 

Grammaticos, B. 384 

Guckenheimer, J. 377 

Gurzadyan, V. G. 378 

Gylden, H. 271 

Gyorgyi, G. 380 

Hadamard, J. 12, 12 

Hagedorn, P. 377 

Hagihara, Y. 283, 283, 382, 383 

Hale, J. K. 377 

Hall, G. R. 270, 271, 377 

Halley, E. 5 

Halmos, P. R. 71 

Halphen, G. H. 146 

Hamilton, W. R. 34, 35, 41, 57, 61, 62, 
65, 73, 87, 90-99, 103, 109-112, 



116, 140, 141, 158, 253, 332, 365, 
369 

Hawking, S. 178 

Heiles, C. VI, 11, 11, 12, 123, 123, 124 
Henon, M. VI, 11, 11, 12, 123, 123, 
124, 316 

Hietarinta, J. 336, 336 
Hilbert, D. 337, 337 
Hill, E. L. 378 

Hill, G. W. 260-264, 264 , 293-299, 293, 
299 

Hipparchus 1, 2, 13 
Hirsch, M. W. 377 
Holmes, P. 377 
Hopf, H. 116 
Hurwitz, A. 289 

Ince, E. L. 123 
Israel, G. 8 
Israel, W. 178 

Jacobi, G.G.J. (K.G.J.) F, 57-60, 57, 
64, 68, 74, 90-97, 103, 109, 110, 
112, 116, 140, 141, 158, 186, 186, 
188, 188, 195, 199, 203, 206, 211, 
215, 233, 242, 243, 253, 256-265, 
257, 282, 284, 287, 291, 295, 299, 

323, 323, 333, 334, 341, 365, 370, 

370, 378, 383 
Jaranowski, P. 378 
Jeans, J. H. 209, 352, 352 
Jeffreys, H. 323, 323, 329, 329 

Kalnins, E. G. 384 
Kaplan, W. 382 
Kapteyn, J. C. 352, 352 
Karlovini, M. 384 

Kepler, J. 4, 93, 98, 128, 129, 133, 134, 
136, 139, 151, 153, 155, 156, 162, 

170, 171, 212, 247, 258, 275, 275, 

277, 323, 341, 380 

Killing, W. 45, 45, 46, 48, 136, 189, 
190, 361, 362, 373, 384 
Kobussen, J. A. 378, 380 
Kolmogorov, A. N. 10, 10 
Korteweg, D. J. 117 
Kowalevski, S. 124, 124 
Koyre, A. 376 
Kronecker, L. 107 
Kruskal, M. D. 379 
Kuhn, T. S. 376 
Kustaanheimo, P. 162, 289, 289 




Name Index 387 



Lagrange, J. L. 6, 32, 34-39, 45, 57, 
80-82, 153, 155, 186, 186 , 192, 199, 
203, 206, 208, 209, 211, 215, 216, 
221, 233, 266, 281, 296 
Lanczos, C. 71, 71 , 377 , 378 
Langevin, P. 302 

Laplace, R S. 6, 7, 7, 8, 9, 13, 98, 132, 
136-139, 155, 171, 200, 229, 229 , 
230, 234 

Laugwitz, D. 367 
Lax, RD. 117, 117 
Leach, R G. 380 
Lebesgue, H. L. 71, 196 
Legendre, A. M. 61, 62, 83, 288, 366 
Lemaitre, G. 283 
Lenz, W. 98, 132, 136-139, 171 
Leontovich, A. M. 383 
Levi-Civita, T. 92, 162, 162 , 164, 166, 
178 , 192, 238, 245, 2 ^ 5 , 248, 250, 

252, 252 , 279, 283, 289, 291, 294, 

372, 372 , 383 

Levinson, N. 377 

Lichtenberg, A. J. 377 
Lie, M.S. 44, 117, 117 , 141, 380 
Lieberman, M. A. 377 
Lindsted, A. 10 

Liouville, J. 69, 115, 115 , 120, 141, 
335, 379 

Lipschitz, R. O. 12, 16 

Lissajous, J. A. 30 

Littlewood, J. E. 269, 269 , 286 

Livi, R. 382 

Llibre, J. 382 

Logan, J. D. 378 

Lopez, C. 381 

Lyapunov, A. M. 24, 25, 377 
Lynden-Bell, D. 323, 323 

MacMillan, W. D. 286 
Marchal, C. 208 , 383 
Marcolongo, R. 376 
Markeev, A. 270 
Markushevich, A. I. 329, 329 
Mather, J. 197, 197 
Maupertuis, P. L. M. 58 
McGehee, R. 194 , 197, 197 
McGill, C. 316 
McIntosh, H. V. 380 
Meyer, K. R. 270 , 271 , 377 
Miller, W. 384 
Milne, E. A. 211, 211 
Milnor, J. 381 
Mikkola, S. 381 



Mittag, L. 382 
Moisseiev, N. 272 
Morse, R M. 323, 323 
Moser, J. 10, 180, 180 , 183 , 270 , 292, 
292 , 381 
Motte, A. 5 

Moulton, F. R. 5, 9, 9 , 155, 155 , 156 , 
224, 224 , 271 , 379 

Napoleon 7 
Needhan, T. 382 
Neugebauer, O. 376 
Newton, L 1, 4, 5, 5 , 6, 6 , 13, 126, 127 , 
155, 177, 376 
Ng, E. W. 381 
Nijenhuis, A. 382 

Noether, A. E. 43-49, 52, 136, 139, 188, 
377 , 378 , 380 
Nohel, J. A. 23 
Novikov, S. P. 379 

Odell, A. W. 381 
Ogorodnikov, K. F. 383 
Onofri, E. 380 , 383 
Oort, J.H. 352, 352 
Osipov, Yu. 381 

Pacella, F. 382 

Painleve, R 123, 187, 187 , 192, 192 , 
194, 194 , 195, 249, 379 
Park, D. 380 

Pars, L.A. 67 , 68 , 341 , 377 , 378 
Pauri, M. 380 , 383 
Pavanini, G. 286 , 289, 289 
Peano, G. 8 
Percival, I. 379 
Pettini, M. 378 
Pierce, D.A. 283 
Plummer, H. C. 379 
Poincare, H. 6, 9, 9 , 10, 11, 13, 13 , 65, 
65 , 69-72, 116, 192, 209 , 255, 257 , 
285, 376 

Poisson, S.D. 73, 74, 81, 191, 335 
Politi, A. 382 

Pollard, H. 196 , 201, 201 , 206, 379 
Prince, G. E. 380 
Ptolemy 2, 3, 13 
Pucacco, G. 378 , 384 

Qiu-Dong, W. 245 

Ramani, A. 384 
Raphson, J. 155 




388 Name Index 



Rasband, S. N. 36 
Ricci, G. 372 
Richards, D. 379 

Riemann, G.F.B. 165, 168, 329, 371 

Roels, J. 380 

Rosquist, K. 362^ 384 

Rouche, E. 155 

Routh, E. J. 38, 40, 130, 271 

Roy, A. E. 379 

Ruelle, D. 376 

Ruegg, H. 380 

Ruffini, R. 378 

Rund, H. 377, 378 

Runge, C. 98, 132, 137, 139, 171, 

380 

Saari, D. G. 4, 195, 196, 196, 201, 

201, 215, 245, 245, 382 
Saha, R 316 
Santander, M. 381 
Sarlet, W. 43, 360 
Saslaw, W. C. 214, 302 
Savvidy, G. K. 378 
Scheifele, G. 163, 166, 291 
Schiaparelli, G.V. 4, 375 
Schutz, B. 25 
Schwarz, H. 181, 199, 203 
Schwarzschild, K. 379 
Schwarzschild, M. 356, 356 
Serafin, R. A. 381 
Siegel, C. L. 180, 180, 183, 270 
Sitnikov, K. 208, 286, 292, 293 
Smale, S. 377, 382 
Smart, W. M. 152, 379 
Smith, G. R. 381 
Sommers, R 384 

Stackel, R 92, 332, 334, 339, 341, 342, 
383 

Stephen, M. J. 382 

Sternberg, S. 298 

Stiefel, E.L. 162, 163, 166, 289, 291 



Sundman, K. F. 162, 198, 198, 199, 
204, 246-249, 252 
Synge, J. L. 42, 59, 71, 86, 377 
Swirles, B. 323, 323, 329, 329 
Szebehely, V. 259, 283, 283, 286, 286, 
294, 376, 383 
Szydlowski, M. 378 

Tabor, M. 123, 359, 377 
Tail, L. G. 379, 381 
Tauber, A. 201 
Taylor, B. 44, 151, 267 
Thiele, T. N. 283 

Tisserand, F. 178, 178, 259, 259, 260 
Toda, M. 117, 117, 119, 119, 120 
Trautman, A. 378 
Tremaine, S. D. 306, 307, 307, 320 

Uggla, C. 384 

Vasilev, V. A. 382 
Verhulst, F. 377 
Vivarelli, M. D. 381 
Von Neumann, J. 289, 302 
Von Zeipel, H. 194, 194^ 195 
Vujanovic, B. D. 378 

Wald, R. 362 

Watson, G. N. 323, 323, 348, 363 
Weierstrass, K. 198, 198 
Weiss, J. 123 
Westfall, R. S. 376 

Whittaker, E.T. 35, 77, 187, 188, 192, 
192, 221, 323, 323, 348, 363, 377 
Widder, D.V. 201 
Wigner, E. P. 289 

Wintner, A. 187, 187, 196, 201, 201, 
216, 216, 217, 217, 230, 238, 238, 
383 

Wodnar, K. 293, 293 



Xia, Z. 197, 197 




Subject Index 



Action 34, 57, 91 
reduced (Maupertuis) 58, 92 
Action-angle variables 98ff 
Action integral 34 
Almagest 3 
Anomaly 
eccentric 147, 318 
(in regularization) 163 
mean 149, 311 
true 133, 312 
Apocentre 157, 315 
Apogee 2 
Apse line 133 
Apsidal angle 131, 143ff 
Attractor 17 

Barker’s tables 148 
Bertrand’s theorem 141ff 
Binet’s method 134 
Birkhoff normal form 270 
Bruns’s theorem 187 

Canonical equations 61 
Canonical transformation 76ff 
Canonical variables 61 
Central configurations 229ff 
Central force 126, 307ff 
Centre 23 

Centre of mass motion 

in the two-body problem 126 
in the A"-body problem 185 
Centrifugal term 130 
Chaos 1, 11, 376 
Collinear solutions 218 

in the three-body problem 218ff 
Collisionless dynamics 303 
Collisions 

in the A"-body problem 192 
Configuration space 32, 43, 65 
Conformal transformations 329 
in the Kepler problem 164ff 



in the restricted problem 279ff 
Conservation of angular momentum 
in the two-body problem 129, 141, 
171 

in the A/'-body problem 190 
in spherically symmetric potentials 
310ff 

Conservative systems 20 
Copernican revolution 4 
Cotangent space 65 
Critical points 17, 23-25 
asymptotically stable 24, 25 
improperly stable 23 
properly stable 23 
saddle 23 
unstable 23 

Deferent 2 
Degeneracy 112 

in the two-body problem 160 
in spherically symmetric potentials 
310 

Degrees of freedom 32 
Delaunay elements 

in the two-body problem 158ff, 

316 

in the restricted problem 272ff 
Determinism 8 
Diffeomorphism 26 
Differential manifold 25 
Dione 271 

Dirichlet cells (principle of) 28 
Dynamical friction 302 
Dynamical systems 16, 376 
autonomous 16 

Dynamical (or crossing) time 303 

Earth potential 349 
Eccentric 1-2 
Eccentricity 132-133 




390 Subject Index 



“Effacement” of the internal structure 
178 

of the point 36, 43 
Elements of the orbit 157ff 
Elimination of nodes 188 
Elimination of the time 75 
Ellipsoidal coordinates 325 
Elliptical coordinates 328 
Epicycles 1 
Equant 3 

Equilateral solution 241 
Equilibrium solutions 17, 26 

in the restricted problem 260-263 
stability 265ff 
Existence theorem 16 

for the A/'-body problem 179ff 
“External” problem 178 

Fictitious time 164 
Final motions 

in the A/^-body problem 207-208 
First integral see Integral of motion 
Flat solutions 217 
Focal ellipse 326 
Focal hyperbola 326 

Galilei group 191 
Gauge terms 44 
Gauss ring 146 

Gaussian sectional curvature 372 
Gegenschein phenomenon 271 
Geodesic 59 

Geodesic deviation (Jacobi-Levi- 
Civita) equation 372 
Geodesic equation 370 
Gyroscopic terms 40 

Hamilton’s principle 34, 57, 62 
Hamiltonian 41 
Harmonic oscillator 19 
anisotropic 53, 97, 335 
isotropic 50, 96 
A/^-dimensional 26, 50, 53, 95 
planar 26, 96 
Hidden symmetry 52 
Hill 

curves 264-265 
equation 299 
lunar theory 296-299 
problem 293-296 
Holonomic systems 32 
Homeomorphism 171 
Homographic solutions 219 



collinear 222 
flat 224 
spatial 227 

Homothetic solutions 220 

Ignorable variables 38 
Inclination 157 
Integrability 117, 124 
Integral of motion 18-20, 46, 304, 315 
configurational 360-361 
isolating 188, 305, 315 
Integral invariants 66 
absolute 67 
Cartan 70 
Poincare 69 
relative 67 
Integral manifold 18 
“Internal” problem 178 
Invariant see Integral of motion 
Invariant plane 200, 218 
Involution 113 
Isochronal potential 316ff 
Isospectral deformation 118 

Jacobi 
constant 258 

coordinate system 242-244 
geometry 378 
identity 73 
integral 64, 258 
multipliers 68 
stability criterion 208 
Jupiter 2, 200, 259, 271, 283-286 

KAM theorem 10 
Kepler equation 151ff, 323 
Kepler laws 4 
first law 133 
problem 136, 158 
second law 129 
third law 134 

Killing equation 45-48, 50, 361, 373 
Killing tensor 362, 373 
Killing vector 362, 373 
K-S matrix 162, 289-291 

Lagrange-Dirichlet theorem 266 
Lagrange- Jacobi identity 186 
Lagrangian 34 
reduced 38 
Lagrangian solutions 

three-body problem 239ff 




Subject Index 391 



Laplace-Runge-Lenz vector 132, 

136ff 

Laplace theorem 229ff 
Least-action principle see Hamilton’s 
principle 

Jacobi form of 57 
Legendre transformation 61, 83 
Levi-Civita transformation 164ff, 279, 
282 

Libration 111, 355 
Lie’s theorem 117 

in the two-body problem 141 
in the AT-body problem 192 
Line element 33 
action 59 
kinematical 59 
Line of the nodes 157, 309 
Linearization (of a system) 20 
Liouville theorem 68 
Lipshitz functions 16, 179 
Lissajous figures 30 
Longitude of the ascending node 157, 
313 

MacMillan problem 286ff 
Mars 2 

Mean motion 134 
Mean potential 302 
spherical 303, 307 
spheroidal 303 
triaxial 352 
Mechanical systems 

completely degenerate 112 
conditionally periodic 112 
multiperiodic 108 
Mercury 2 
Mixed terms 9 
Moon 4ff, 265, 271, 293-299 
motion of 293ff, 383 

Natural system 40 
Neptune 200 
Newton 

law of universal gravitation 4, 126 
Principia 2, 4-6 
Newton-Raphson method 155 
Noether’s theorem 43ff 

and the integrals of the motion 
in the A/'-body problem 188ff 
and the L-R-L vector 136ff 
and the N'-dimensional oscillator 50 
non-conservative system 20, 47 
non-holonomic systems 32 



Orbits 1, 27 
box 356 
closed 314 
elements of 157ff 
periodic 314 
relative 127 
tube 357ff 
Oscillatory motions 

in the A^-body problem 207 

Painleve property 123 
weak 124 

Parabolic coordinates 331 
Pendulum 102 
linearized 106 
Pericentre 133, 315 
Perigee 2 

Perihelion, advance of 146 
Periodic function 107 
Perturbation theory 9 
Phase curve 16-17 
Phase fiow 18 
Phase space 16, 65 
extended 71, 85ff 
Planar solutions 217 
Poincare 
elements 279 
integral invariants of 69 
recurrence theorem 71 
Point transformation 43 
Poisson brackets 73 
Poisson’s theorem 74 
Precession 314 
Pseudocollisions 194 
Ptolemaic system 3 

Quasi-symmetry 50 

Rectilinear solutions 219 
Regularization 162 

two-body problem 164ff 
three-body problem 
Sundman 246-248 
Levi-Civita 248-256 
restricted three-body problem 279ff 
three dimensional restricted 
problem 289-291 

Relative equilibrium solutions 220 
Resonance 10 

Restricted three-body problem 257ff 
three-dimensional 284ff 
Ricci tensor 372 
Riemann curvature tensor 371 




392 Subject Index 



Riemannian metric 33 
Rotation 111, 355 
Routh 

function 38, 130 
Routh (cont.) 
value 271 

Saturn 2, 200, 271 
Secular terms 9 
Separability 92, 332 
in the Kepler problem 139 
Singularities 163 

in the A^-body problem 192ff 
Sitnikov problem 208, 286, 292 
Small divisors 10 
Solar System 200 
Space of events 42 
Spheroidal coordinates 331 
Stability 23-25 
effective 286 

Stackers theorem 92, 332-334 
Stationary action principle 
see Hamilton’s principle 
State space 65 



Sun 2, 265, 271, 284 

Sundman’s theorem 198ff 
Surface of section 11 
Synodic system 257 

Syzygy 218 

Tangent* space 65 
Tauber ian theorem 201 
Three-body problem 238ff 
Tisserand’s criterion 259 
Toda lattice 117 
Torus 27 

rational (resonant) 10, 112, 
305 

Trojans 271 

Two-body problem 126ff 
Uranus 200 
Venus 2 

Virial theorem 209ff 
Von Zeipel’s theorem 194 





LIBRARY 



ASTRONOMY AND 
ASTROPHYSICS LIBRARY 



Series Editors: I.Appenzeller • G. Bomer • A. Burkert • M.A. Dopita 

T. Encrenaz • M. Harwit • R. Kippenhahn • J. Lequeux 
A. Maeder • V. Trimble 



The Stars By E. L. Schatzman and F. Praderie 
Modern Astrometry 2nd Edition 
By J. Kovalevsky 

The Physics and Dynamics of Planetary 
Nebulae By G. A. Gurzadyan 

Galaxies and Cosmology By F. Combes, P. 
Boisse, A. Mazure and A. Blanchard 
Observational Astrophysics 2nd Edition 
By P Lena, F Lebrun and F Mignard 
Stellar Interiors. Physical Principles, 
Structure, and Evolution 
By C. J. Hansen and S. D. Kawaler 

Physics of Planetary Rings Celestial 
Mechanics of Continuous Media 
By A. M. Fridman and N. N. Gorkavyi 

Tools of Radio Astronomy 4th Edition 
By K. Rohlfs and T. L. Wilson 

Astrophysical Formulae 3rd Edition 
(2 volumes) 

Volume I: Radiation, Gas Processes 
and High Energy Astrophysics 
Volume II: Space, Time, Matter 
and Cosmology 
By K. R. Lang 

Tools of Radio Astronomy Problems and 
Solutions By T. L. Wilson and S. Hiittemeister 
Galaxy Formation By M. S. Longair 

Astrophysical Concepts 2nd Edition 
By M. Harwit 

Astrometry of Fundamental Catalogues 

The Evolution from Optical to Radio 

Reference Frames 

By H. G. Walter and O. J. Sovers 

Compact Stars. Nuclear Physics, Particle 
Physics and General Relativity 
By N. K. Glendenning 

The Sun from Space By K. R. Lang 
Stellar Physics (2 volumes) 

Volume 2: Stellar Evolution and Stability 
By G. S. Bisnovatyi-Kogan 



Theory of Orbits (2 volumes) 

Volume 1 : Integrable Systems 
and Non-perturbative Methods 
Volume 2: Perturbative 
and Geometrical Methods 
By D. Boccaletti and G. Pucacco 

Black Hole Gravitohydromagnetics 

By B. Punsly 

Stellar Structure and Evolution 

By R. Kippenhahn and A. Weigert 

Gravitational Lenses By P Schneider, 

J. Ehlers and E. E. Falco 

Reflecting Telescope Optics (2 volumes) 
Volume I: Basic Design Theory and its 
Historical Development. Second Edition 
Volume II: Manufacture, Testing, Alignment, 
Modem Techniques 
By R. N. Wilson 

Interplanetary Dust 

By E. Griin, B. A. S. Gustafson, S. Dermott 
and H. Fechtig (Eds.) 

The Universe in Gamma Rays 

By V. Schonfelder 

Astrophysics. A Primer By W. Kundt 
Cosmic Ray Astrophysics 
By R. Schlickeiser 

Astrophysics of the Diffuse Universe 
By M. A. Dopita and R. S. Sutherland 

The Sun An Introduction. Second Edition. 

By M. Stix 

Order and Chaos in Dynamical Astronomy 

By G. J. Contopoulos 

Astronomical Image and Data Analysis 

By J.-L. Starck and F. Murtagh 

The Early Universe Facts and Fiction 
4th Edition By G. Bomer 

The Solar System 4th Edition 

By T. Encrenaz, J.-P. Bibring, M. Blanc, 

M. A. Bamcci, F. Roques, Ph. Zarka 




